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FOREWORD 


This volume is aimed at providing a clear and concise introduction 
to modem ways of conceptualizing the decision-making process— 
ways that have provided powerful analytic tools for the complex 
decision-making tasks of today’s managers and organizations. 
Detailed treatments of the new decision-making techniques are 
available in several textbooks, but the student who is introduced 
to these ideas for the first time, and especially the student who 
does not intend to become a specialist in operations research, 
wants initially an overview of basic concepts and ideas. It is such 
an overview that Professor Kassouf provides in this volume. 
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FOREWORD 


The volume was designed to be usable as the text in that section 
of an introductory course on foundations of management or 
administration devoted to decision making, and it was developed 
within the limits of size and technical level appropriate to that use. 
In time, we hope that it will be incorporated in a series treating 
in a similar manner a whole range of basic management concepts— 
a series that can serve as the text material for a full semester course 
on foundations of administration. Since Professor KassouPs 
manuscript provides an exceptionally clear and useful introduction 
to normative theories of decision making, usable as an independent 
unit in a variety of contexts, we have thought that it should be 
made available to teachers and students as promptly as possible, 
and without waiting for other units of the prospective series. 

The interest and practical importance of normative decision 
theory is not limited to management situations where we can 
actually employ formal techniques like linear programming or 
Bayesian decision theory. Even in areas where we will continue, 
for a long time, to make our decisions in less formal and more 
qualitative fashion, the ideas underlying the formal theory—ideas 
of utility, constraints, probability, and so on—will be of great 
value in helping us to think clearly about complex problems. 
Formal decision theory is the logical foundation for a set of 
powerful mathematical techniques; but it is also a mode of thought 

a point of view that can illuminate many situations where we 
may not see our way to the literal application of formal techniques. 

Decision theory today continues to develop vigorously and 
rapidly. New approaches—for example, so-called “heuristic” 
methods—are attracting much attention and study. In this volume. 
Professor Kassouf has not undertaken to depict this cha ngin g 
scene in detail, but rather to focus on fundamental ideas that will 
remain central to an understanding of the decision-making process, 
whatever path new developments may take. The student whose 
interest in decision making is sparked by studying this introduction 
will find no dearth of new roads to explore and new worlds to 
conquer. 


Pittsburgh, Pennsylvania 


Herbert A. Simon 
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INTRODUCTION 


As free men we are forced to link desires with capabilities. We 
may do this impulsively or unconsciously but, willy-nilly, decisions 
must be made. How should an individual or group decide between 
alternative courses of action? This question will be explored in 
this book. 

How decisions should be made, as opposed to how decisions are 
made, involves normative analysis. A few decades ago many social 
scientists became disenchanted with normative theories because 
they seemed “unscientific.” For example, when an economist ad- 
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vises that a tariff should be imposed on automobiles, he is express¬ 
ing a value judgment that discriminates against consumers in favor 
of the auto industry. But the economist has no expertise in forming 
value judgments-his competence as an economist does not make 
his preferences superior to a layman’s. And so it was said that 
normative or prescriptive theories should be abandoned by social 
scientists because values are not subject to scientific analysis 
If a normative theory specifies values or goals, then there may 
L D °h ? erK ' f> h0WeVer ' g0alS - Val ““. ” desires are gi ,en, 

i rr° n r ak T may “ srfuIIy ex P loit "experts” who can 
instruct them how they should behave. This will be our starting 

point: we assume that the decision maker starts with explicit 
values or goals and that he is to decide how to make the best 
choice among available alternatives. This assumption eliminates 

nevCT ‘Wn’’ ? behavior ’ becau se desires are almost 

never given and are seldom clear. The “value [of intellect! to 

any man lies in the speed and strength with which cm he^im 

to clanfy his desire and to act or build according to its dictates.” i 

HivlZu T that mstltutlonal for ces twist and distort an in- 
drndual s Preferences so that he no longer knows what is “good.” 

John Kenneth Galbraith, in The Affluent Society, argues that 
advertising m our society tends to lead people away from their 

S ° m ' ° f “ are J^ges than otas 
, g goa s and desires a proposition readily acceptable by 
most when considering adults versus children or the mentally 

rukbvtrf' T thlS dangerous Proposition has too often led to 
Mestentk 8 abused “ P ower ‘ In book we 

is “b2.” ^ Pretend that thC deCisi ° n maker knows w hat 

th f V “l f WC f SS r that desireS are given ’ we sha U have to justify 
inst'fi ,. men ‘ ^ 0U s h° u !d do this to achieve your goal The 

^ fonn 481 a person 

would do tins or that. This leads to the question: What do we 

Jacques Barzun, The House of Intellect TNpw VnrV- tr 

Publishers, 1959), p. 16 9. ( York ' Har P er & Row, 
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mean by “rational”? We shall show that if preferences are given 
and if a few assumptions (or axioms) are valid for a decision 
maker, then logical consequences will dictate his behavior if he 
does not wish to act illogically. We will call this behavior rational 

By a normative theory we mean a theory that prescribes how 
decisions should be made, given goals and values. By rational 
behavior we mean behavior that is logically consistent. Thus, to 
say that goals cannot be established scientifically, or that ’the 
theories are “value free” are not valid objections to the theories 
we present. 

We begin in the simplest of all worlds—the never-never land of 
complete certainty, where each decision maker knows precisely the 
consequence attached to every course of action available to him. 
We then move to situations that are best evident in the casino; 
each consequence cannot be known in advance, but we can place 
definite probabilities on the possible consequences. Moving still 
closer to the real world, we next consider decision situations in 
which either the probabilities associated with the outcomes are a 
matter of individual judgment, or no judgment at all can be made 
about them. Along the way we consider a specific application of 
decision making to portfolio selection. Further on we examine 
decision making under conflict, and finally, collective decision 
making. 

The treatment of these topics is neither completely rigorous nor 
completely intuitive but somewhere between these poles. Rigor 
was traded for informality to the point where (the author hopes) 
a beginner s understanding will be optimized. Although much use 
is made of symbols, the reader only needs knowledge of differential 
calculus. 

Each chapter could easily be expanded into a large volume. The 
main purpose of this work is not to treat any one topic in detail, 
but to show the framework connecting a number of areas in 
modem decision making. 

At least one major area of decision theory has been omitted 
entirely, situations in which the set of alternatives is not clearly 
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d^ed but must be found or designed. Until recently, this has 
n a major soft spot in the state of the art This problem is now 
under concerted attack (see Ref. 8, Chap. 7). 


I am deeply grateful to Professor Herbert A Simon for w. 
meticulous reading of the entire manuscript and his many useful 
comments. He is no, to be implicated for remaimng 
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CHAPTER TWO 


A decision situation exists if one must choose among alternative 
courses of action. For those of us who believe that the future is 
not completely predetermined, decision situations dominate our 
lives: of those schools that will accept me, in which shall I enroll; 
of those girls who will say “yes,” which shall I marry; of those jobs 
offered to me, which shall I take; of all the available houses, which 
shall I buy; etc. These decisions are often made impulsively, 
whimsically or, in some cases, after deliberate thought. In this 
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chapter we begin to answer the question, “How should decisions 
be made?” To erase the illusion of omnipotence, we stress at the 
outset that we will only advise the decision maker if he is rational 
and if he can reveal to us his values, terms which we now try to 
define. 


strategies, outcomes, and certainty 


Let us denote the courses of action, or strategies, in any 
given situation by Si, S 2 ,.. ., S n and the possible outcomes by Ci, 
C 2 , , C r . Note that several strategies may have the same out¬ 

come. Strategies are means to ends; they interest us only because 
they lead to goals that satisfy our desires. (Desires need not be 
selfish or hedonistic. A parent’s desire to send a son to college 
could involve working long, hard hours.) 

We can always define an outcome so that it contains all the 
aspects that will add or detract from our desires. For instance, a 
strategy available to me in a given situation might be S h “go 
downtown and buy a suit.” The outcome associated with this 
strategy might be, C u “own a blue suit at a cost of $50.” But if the 
act of going downtown is distasteful, then Q might be defined 
own a blue suit at a cost of $50 after an uncomfortable 45-minute 
ride. An outcome is so inclusively defined because we wish to 
compare possible outcomes and choose a strategy solely on the 
basis of these outcomes. 

Whenever it is known in advance that every strategy will lead to 
an unequivocal outcome we say we are in a state of certainty. 
Conceptually, decisions under certainty are the easiest to cope 
with: choose that strategy which leads to your most desirable 
outcome. We shall examine in some detail the decision situation 
under certainty because it will lay the foundation for the more 
interesting and relevant case of uncertainty. 
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rational decision makers 

Much economic theory is concerned with states of cer¬ 
tainty. Almost all of this theory is dominated by a “rational” man 
whose actions spring from the introspective question, “How shall 
I behave in these circumstances?” It is not always clear whether 
economic theories describe actual behavior or whether they at¬ 
tempt to prescribe behavior. The desire to advise a decision maker 
how he should behave, does not guarantee that what follows is 
descriptive of anybody’s behavior. We are only concerned with 
the rational decision maker; it is essential that we define this 
creature carefully. An individual who does not know his own tastes 
and preferences is beyond our help. He should come to us only 
after some combination of eugenic, parental, and societal influence 
has instilled in him distinct preferences that are “consistent.” 
Specifically, for a rational individual, the following axioms are 
valid. 

AXIOM 1. When faced with any two possible outcomes, Cy and Cy, 
rational man will prefer Cy at least as much as Cy, and/or Cy at least as 
much as Cy. (If Cy is preferred at least as much as Cy, we write Cy > Cy. 
If Cy > Cy and simultaneously Cy > Cy, we write Cy = Cy.) 

AXIOM 2. If Cy is preferred at least as much as Cy and Cy is preferred 
at least as much as Q, then C* is preferred at least as much as C&, where 
Q, Cy, and Cjc are any three possible outcomes. (In the above notation: 
If Cy > Cy and Cy > Cfc, then Cy > C k .) 

We are not claiming that rational individuals exist, but perhaps 
we should dispel the fear that we may be embarking upon an 
enterprise that will have no earthly usefulness. Surely some deci¬ 
sion situations exist in which these two axioms describe the vast 
majority of us. For example, consider outcomes that involve only 
sums of money ranging from $1 to $100 in increments of $.01. 
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There are 10,000 different outcomes in this set, but for most of us 
their ranking is straightforward and obeys the axioms. In many 
situations, of course, where outcomes include many diverse dimen¬ 
sions, the axioms might be questioned. For example. Axiom 1 
seems to be violated very often when a woman is shopping—her 
hesitation suggests that she may not know whether she prefers the 
green hat to the feathered hat. And there is much evidence that 
Axiom 2 is often violated, especially when the outcomes are not 
of very great concern to the decision maker. Nevertheless, many 
decision situations probably exist in which the two axioms are 
valid. It is these situations that will concern us in this book. 


a consumer's decision problem 

Consider the decision situation facing a housewife in a 
supermarket The strategies available to her are staggering: the 
purchase of every possible combination of food items that fall 
within her total food budget. For instance. Si might be “buy 100 
quarts of milk” and C h “family consumes 100 quarts of milk (in 
given time period).” Hers is the classic problem of consumer 
choice under certainty. 1 Without any loss in generality, we can 
consider the somewhat simpler situation facing her husband who is 
contemplating the purchase of only steak and potatoes for a 
barbecue. 

In addition to the rationality axioms, economists assume that, 
within a relatively wide range, the consumer would prefer more 
of a good to less of a good. (If he preferred less of a good, we 
might then call it a bad rather than a good,) For simplicity, we will 
make that assumption here. We wish to advise the consumer how 

1 There is an implicit assumption that the housewife knows what she is 
buying. If there is some doubt because of misleading advertising, then 
there may be more than one possible outcome associated with each 
purchase. This would no longer be a problem of choice under condi¬ 
tions of certainty. 
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many pounds of steak and potatoes to buy. Let us first depict his 
preferences graphically. In Fig. 2-1 we measure pounds of steak 



on the horizontal axis and pounds of potatoes on the vertical axis. 
Every point in this quadrant represents some food basket of steak 
and potatoes. The point A, for example, represents the basket 
containing 5 pounds each of potatoes and steak. If we draw a 
vertical line and a horizontal line through this point, we have 
divided all possible food baskets into four regions. Region I, 
northeast of A, represents food baskets preferred to A because 
these baskets contain more of both steak and potatoes. Region III 
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to the southwest of A represents food baskets that are not pre¬ 
ferred to A since these baskets contain less of both steak and 
potatoes. All baskets (if any) that are exactly as desirable as A 
therefore lie in regions II or IV. We wish to discover all the 
baskets that this consumer believes equally as desirable as A. If 
we start with a basket containing 3 pounds of steak and 5 pounds 
of potatoes (point B), we know it is not preferred to A. But if we 
adding potatoes to this basket (moving vertically from B in 
Fig. 2-1), it is conceivable that we will reach a basket that this 
consumer considers just as good as A. Say this happens when there 
are 8f pounds of potatoes in this basket containing 3 pounds of 
steak—point C. In this way, starting with different amounts of 
steak, we can find those amounts of potatoes that must be added 
to make the combination as equally attractive as basket A. Con¬ 
necting all these points, we call the resulting curve an indifference 
curve. All baskets lying on this curve are equally attractive. We 
know the curve must slope downward (because it must lie in 
regions II and IV). 

Proceeding in this way, we can fill the entire quadrant with in¬ 
difference curves, a complete collection of which is called a con¬ 
sumer’s indifference map. (See Fig. 2-3.) These curves cannot inter¬ 
sect as in Fig. 2-2, for if they did our axioms would be violated. 
This can be seen by noting that baskets A and B are equally 
desirable (in Fig. 2-2) because they he on the indifference curve 
labeled I. Baskets A and C are also equally desirable because they 
he on the indifference curve labeled II. Therefore, by Axiom 2 
B and C are equally desirable. But C lies to the northeast of B and 
hence is preferable to B. Therefore, if our axioms and assumptions 
are not to lead to contradictions, a rational man’s indifference 
curves will not intersect. 

We have yet to advise our by-now-impatient consumer which 
basket to purchase. To do so, we must take into account his avail- 
able resources. If he were not restrained, he would merely choose 

at basket with as much steak and potatoes as he could use. But 
existing prices and the funds he has limit the strategies available 
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to him. Suppose that steak costs $1.50 per pound, potatoes $.50 
per pound, and our consumer wishes to spend $15. He could 
purchase the basket containing 10 pounds of steak and no pota¬ 
toes, denoted by A in Fig. 2-3. Or he could purchase the basket 
containing 30 pounds of potatoes and no steak, denoted by B. 
In fact, his $15 will purchase any of the baskets represented by 
points on the line connecting A and B. This line, called the budget 
line, is determined by the prices of steak and potatoes and by the 
consumer’s resources. This line represents all the available strate¬ 
gies. We would advise our man to choose that strategy which 
resulted in his most preferred outcome, that is, that strategy which 
places him on the highest possible indifference curve, because 
higher curves mean greater preference. 

His indifference map is as shown in Fig. 2-3 (the most preferred 
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» rePreSen,e< ' ? y C) Where his tad 8« is tangent to an 
from BXa'rTr f ^ consumer m0VK *>wn his budget line 
At C L °TI ' h '. K u mOV, "t! onto higher indifference curves. 
C he reaches the highest possible curve. As he moves further 



downward toward A, he is moving onto lower indifference curves 
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13 DECISION MAKING UNDER CERTAINTY 

then the solution to a decision problem would involve the purchase 
of only one of the goods involved. Since in most situations it is 
observed that consumers do not specialize in the purchase of one 
goody it is then assumed that the indifference curves are convex.} 

Let us review our procedure. We asked a decision maker to 
reveal his preferences or indifference map and the strategies avail¬ 
able to him, that is, the size of his budget. Then we advised him 
to choose the basket he preferred most. In general, our advice 
under conditions of certainty will always be the same: choose that 
strategy which has the most preferred outcome. Under these cir¬ 
cumstances, why retain an adviser? 

There might be some value to the decision maker in this proce¬ 
dure. Which basket should he choose if the prices of the goods 
change, or if his budget changes? Once his preference pattern is 
known, these questions can be answered quickly and he need not 
expend energy comparing the available alternatives. There are 
many computational devices that will yield the correct answer— 
the geometrical technique we just used and some algebraic proc¬ 
esses, notably the use of Lagrangian multipliers. (See Ref. 2.) So 
decision making under certainty is not a completely vacuous affair. 
However, the real purpose of our close scrutiny is the help it will 
give us in advising decision makers when the outcome for a 
strategy is not known with certainty. 


utility functions 

Suppose that outcome A is preferred at least as much 
as outcome B. If, in addition, outcome B is preferred at least as 
much as outcome A, we say that A and B are equivalent. All of the 
outcomes that are equivalent to A comprise an equivalence class. 
For example, all the outcomes represented by an indifference curve 
(in Fig. 2-3, say) are equivalent, and each indifference curve rep¬ 
resents an equivalence class. The indifference map partitions all 
of the outcomes in a given situation into equivalence classes. If 


HUNT LIBRARY 

CAflHEGIE-MELLON UNIVERSITY 
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an outcome in equivalence class II is preferred to an outcome in 
■equivalence class I, then every outcome in class II is preferred to 
every outcome in class I. In this case, we say that equivalence 
class II is preferred to equivalence class I. For example, in Fig. 
2-3, the equivalence class represented by the indifference curve 
labeled II is preferred to the equivalence class represented by the 
indifference curve labeled I. 

If we assign real numbers to each equivalence class so that the 
more preferred the equivalence class the higher the number, the 
assignment is called one of the decision maker’s utility functions. 
A utility function is then a rule that associates with each outcome 
a real number so that if Ci is preferred to C 2 , then the number 
assigned to Q is greater than the number assigned to C 2 . For 
example, a utility function might assign the numbers 5, 6, and 7 
to indifference curves I, II, and III, respectively, in Fig. 2-3. 
Another utility function, expressing exactly the same decision 
maker’s preferences might assign, the numbers 24, 34.56, and 678 
to these curves. Only the order of the numbers is relevant in 
defining these functions which are called ordinal utility functions. 

Axioms 1 and 2 assure us that a utility function exists under 
most general conditions. 2 If this function can be expressed neatly, 
then the solution to a decision problem can be solved with little 
effort, no matter how the restraints operating on preferences 
change. For example, after plotting some points of indifference, 
the consumer in the barbecue example above, might have found 
that his preferences could be summarized in the following way: 
to a basket containing x pounds of steak and y pounds of potatoes, 
assign the number x times y. This number might then express his 

2 We could assume a third axiom involving certain topological considerations 
that would insure the existence of a utility function under all condi¬ 
tions. Because the exceptions are somewhat pathological it would be 
unrewarding to include such an axiom at this level of treatment. For 
a rigorous statement on the existence of a utility function see Gerard 
Debreu, “Representation of a Preference Ordering by a Numerical 
Function,” Decision Processes, R. M. Thrall, C. H. Coombs, and R. L. 
Davis, eds., (New York: John Wiley & Sons, Inc., 1954), pp. 159-65. 
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preference for a basket relative to other baskets. If one basket had 
the number 25 assigned and another the number 9, then the basket 
with the higher number is preferred. The number attached to each 
basket is called the utility of the basket. It is important to realize 
that the utility of an outcome is a consequence of the decision 
maker’s preference. It is merely a shorthand method of describing 
his preference. 

In Fig. 2-1, the indifference curves could be described by the 
utility function which assigns to each basket the product of the 
number of pounds of steak by the number of pounds of potatoes. 
For example, the utility for point A representing 5 pounds each of 
potatoes and steak is 5 times 5, or 25. Note that point C also has 
utility 25. This utility function would be written U(x, y) = xy. 3 
The indifference curve passing through point A could thus be 
labeled 25. The curve passing through point B in Fig. 2-1, could 
be labeled 15. Given this definition of utility, our advice to the 
decision maker can be summarized: Maximize your utility! This 
becomes a simple engineering problem if his utility function can 
be expressed mathematically. Herein lies the power and usefulness 
of the procedure outlined in this chapter. Maximizing techniques 
under constraints are well developed; some maximization methods 
use Lagrangian multipliers and linear and nonlinear program¬ 
ming. As a simple example let us solve the decision problem of 
Fig. 2-3 analytically. 


an example of utility maximization 

Since we know the x-axis intercept of the budget lin e 
equals 30 and its slope equals —3, the equation of the budget line 
in Fig. 2-3 is 

(1) y = 30 - 3x 

* I n this case, since each outcome contains two goods, the utility function is 
a real-valued function of two variables. 
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where y is pounds of potatoes and x is pounds of steak. We also 
know the solution occurs at the point of tangency between an 
indifference curve and this budget line. The equation of any indif¬ 
ference curve may be written 

(2) xy = constant = C 

or 


y = C/x 

where C is the real number assigned by the utility function to each 
equivalence class, or indifference curve. Differentiating this for the 
slope of the curve yields -C/x 2 . The solution occurs at the point 
where this slope is equal to the slope of the budget line 


—C/x 2 — —3, so that C = 3x 2 


Substituting in (2) yields y = 3x at the point of tangency. Sub¬ 
stituting this into (1) yields 


3x = 30 — 3x, or x = 5 


Again substituting into (1) we have that y = 15. This consumer’s 
optimal strategy is to purchase 5 pounds of steak and 15 pounds 
of potatoes. This is his most preferred outcome from all the avail¬ 
able alternatives. His utility will be 5 X 15 = 75. 

Had we used any other utility function that maintained the same 
preference ordering, that is a utility function that assigned any 
other numbers to the curves provided that higher curves had 
higher numbers, this analysis would have resulted in the same 
solution. For example, the same preferences could be summarized 
by the utility function that assigns to each basket the utility 2 xy, 
or U(x, y) = 2xy. 4 This utility function would assign numbers 
twice as large as the original function considered in the above 

4 Any monotonic increasing function of U is again a utility function. 
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example, but would be maximized by the choice of the same 
basket. 


more decision making under certainty 

Another example of decision making under certainty 
is the so-called diet problem. Suppose the foods available to a 
housewife are designated by x u x 2 , etc., and that she is concerned 
with the amounts of nutrients, a, b, c, ..., that any basket of 
food will yield. Every basket can be designated by (q u q 2 ,. . .) 
where q { is the amount of x* in the basket. If a x is the amount of 
nutrient a in a unit of x u a 2 the amount of nutrient a in a uni t of x 2> 
etc. then the amount of a in a particular basket is 

OiQi + (hq 2 + . . . + Onq n 

The amount of b in this basket would be a similar expression with 
biS replacing the a/s. It is sometimes said that there is a diet 
which meets minimum nutritional requirements measured in terms 
of the nutrients a, b, c, . . . . Suppose a housewife wishes to meet 
this minimal requirement at least cost. If a unit of jc, costs p i} then 
the cost of any basket is 

(3) Piqi +p 2 q 2 + ■ . .+ p n q n 

Her problem, then, is to find the least expensive basket satisfying 
the minimal diet requirements, that is to minimize (3) subject to 
inequalities of the form 

a iqi + < 22#2 + . . . + Q n q n > a 
b\q 2 + b 2 q 2 + . . . + b n q n ^ b 

The technique of linear programming can be used to find a solu¬ 
tion. In cases where the expression to be minimized is no nlin ear, 
nonlinear programming may be used. 
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It might be useful to look at this problem in terms of the utility 
analysis developed earlier. To simplify matters, consider only two 
nutrients, a and b, and two foods, x and y. Suppose that 1 pound 
of x costs $2 and contains 4 units of a and 2 units of b, and 
1 pound of y costs $3 and contains 3 units of a and 6 units of b. 
A nutrition expert declares that a minimal diet should contain 12 
units of a and 18 units of b. A housewife perceives the decision 
situation facing her in this way: to purchase that basket of x and y 
that supplies at least the minimal diet at minimum cost. In Fig. 
2-4 different combinations of a and b are represented by points 
in the quadrant. Point A represents 12 units of a and 18 units of b. 



FIG. 2-4 


19 DECISION MAKING UNDER CERTAINTY 

the minimal diet. The ray from the origin marked X represents 
the combinations of a and b that can be obtained solely by buying 
x; the ray marked Y represents the corresponding combinations 
obtained by buying y. 

Again we can represent a budget line by a straight line in this 
quadrant. For example, consider the ways in which $6 can be 
spent on x and y. One way is to purchase 3 pounds of x, yielding 
12 units of a and 6 units of b. This choice is represented by point B 
in Fig. 2-4. Another way is to purchase 2 pounds of y, yielding 
6 units of a and 12 units of b. This is represented by point C. 
The straight line connecting B and C represents all the possible 
nutritional yields arising from baskets of food containing jc and y 
that cost $6. Every budget line will have the same slope as this 
line and will be bounded by the two lines rising from the ori gin 
The first step is to find the budget line that goes through A. This 
is the $10 budget line; any of the points connecting D with E can 
be attained for $10. For instance D can be attained by purchasing 
only y and E by purchasing only x. The distance from D to A 
is + the distance of D to E, so to purchase the basket yielding A, 
i of the budget should be spent on x ($2) and | on y ($8). That 
is, 1 pound of x should be bought yielding 4 units of a and 2 units 
of b, and 2| pounds of y should be bought yielding 8 units of a 
and 16 units of b. This is the minimum satisfactory diet at least 
cost. 

Posing the problem in this way leads to an interesting preference 
pattern for the decision maker. In Fig. 2-5, all baskets yielding less 
than 12 units of a or 18 units of b are less preferable than A. In 
addition, all baskets yielding more than 12 units of a and 18 units 
of b are preferred at least as much as A. If these latter baskets are 
preferred more than A, then the indifference curve passing through 
A consists of the vertical line segment above A and the horizontal 
line segment to the right of A. (See Fig. 2-5.) The problem can 
then be stated: find the minimum cost in attaining this given level 
of nutrition. (In the earlier consumer choice problem, the budget 
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was given and the problem was to find the highest level of prefer¬ 
ence attainable.) 

This characterization of the diet problem assumes complete 
independence of each of the nutritional requirements. If one 



nutrient could be substituted for another, it might be better to 
return to a formulation in terms of “smooth” indilference curves, 
instead of the sharp dog-leg of Fig. 2-4. Let us see how these 
alternative assumptions about utility relate to each other. Suppose 
that different levels of nutritional adequacy exist, and that for 
most relevant ranges, trades can be made between nutrient a and 
nutrient b without sacrificing the nutritional level. For instance, 
starting with A in Fig. 2-5, suppose a nutritional expert declared 
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that if 1-g- units of a were removed from the combination and 
3 units of b added, the level of nutritional adequacy would be 
maintained. This is represented by point B. The same level of 
nutrition might be described by an indifference curve similar to 
the one passing through A and B. The new problem might then 
be stated: given the level of nutrition denoted by this curve, attain 
it at least cost. This is now the familiar problem of finding the 
budget line tangent to this curve and purchasing that basket rep¬ 
resented by the point of tangency. If substitution between nutrients 
is defined by smooth indifference curves, then this latter method is 
more efficient than the linear progra mmi ng technique. The solu¬ 
tion would now be represented by C in Fig. 2-5, which is on a 
lower budget line. That is, the same level of nutrition can be 
reached at lower cost if nutrients can be traded off against each 
other in some ranges. 

Although this latter method is more efficient, it requires more 
information than the linear programming problem. It is necessary 
to define all the indifference curves, or levels of equal nutrition. 
The linear programming problem might be considered an approxi¬ 
mation to the real problem and useful because it introduces a 
certain rule of thumb (the minimum diet) which renders decision 
making easy. When problems are very complex, introduction of 
rules of thumb may be justified in order to make the computations 
relatively easy. It is often difficult to determine how much this 
computational ease costs in terms of efficiency. 

summary 

If Axioms 1 and 2 are valid for a decision maker, we 
say he is rational. These axioms insure that he can order all possible 
outcomes when faced with a decision situation. If each possible 
strategy leads unequivocally to a single outcome, then he should 
choose that strategy which leads to his most preferred outcome. 
This would be trivial; but because the addition of some few as¬ 
sumptions to the rationality axioms leads to powerful computa- 
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tional techniques for solving a decision problem, this advice may 
be very helpful to decision makers. 

Some of the devices used to select a strategy for a rational man 
are geometrical or algebraic, using Lagrangian multipliers or 
linear programming theory. After the decision maker reveals his 
preferences and constraints, these techniques can be brought into 
play to pinpoint the strategy he should follow. In this process the 
advisor brings no value judgments to the scene, except perhaps 
the criterion that the decision maker should receive his most pre¬ 
ferred outcome. 

Under most conditions, the rationality axioms insure the exist¬ 
ence of a utility function—a rule that associates with each outcome 
a real number so that of any two outcomes, the more preferred 
has a higher number associated with it. (See Footnote 2.) If the 
decision maker attains a level of satisfaction measured by the 
utility of the outcome, then our advice can be rephrased: Maxi¬ 
mize your utility! 


PROBLEMS 


1. A consumer wishes to purchase some combination of eggs and milk; 
the price of eggs is $1 per dozen, and the price of milk is $.50 per 
quart. The consumer wishes to spend $5 for this combination. His 
utility for a combination of X dozen eggs and y quarts of milk is X-y. 
What combination of eggs and milk should he purchase? 

2. In Problem 1, if his utility function is log (X*y), what combination 
should he purchase? 

3. Suppose a consumer preferred combination A to B whenever A 
contained more eggs or whenever A and B contained the same num¬ 
ber of eggs but A had more milk. What would his indifference map 
look like? 
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DECISION MAKING 
WITH 

OBJECTIVE 

PROBABILITIES 


Man rarely inhabits the land of complete certainty. And perhaps 
it is a good thing. Although he seems intolerant of extreme in¬ 
security—as evidenced by his concern for religion, science, labor 
unions, unemployment insurance, cartels, price fi xin g—it would 
be a dull existence if he knew with certainty the consequence of 
every act. Carried too far, security squeezes out many of the 
pleasant surprises and joys of anticipation. But we need not fret. 
Man may never achieve the ability to predict perfectly. 


25 
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objective probabilities 

Let us take one step toward reality by considering the 
following decision situation: each available strategy has one or 
more possible outcomes and the probability of each outcome is 
known. Gambling provides many examples. The toss of a coin 
can result in two possible outcomes, heads or tails. For most 
coins, and under most circumstances, we are willing to assume 
that the probability of either outcome is equal. This is expressed 
by the statement: the probability of heads is | and the probability 
of tails is 5 . This assignment of numbers (between 0 and 1) to 
possible outcomes may be based upon a physical examination of 
the coin, or upon observing repeated tosses, or both. One inter¬ 
pretation of this probability statement is that if the coin is re¬ 
peatedly tossed, the proportion of heads (or tails) in the total 
number of tosses will approach §. Under this interpretation, out¬ 
comes have probabilities in much the same way that physical 
objects have certain characteristics. We hypothesize that this book 
has some specific weight. Careful testing with equipment might 
lead us to state that the weight is 9.3 ounces. But no amount of 
examination can lead us to say that the weight is exactly 9.3 
ounces, because we have no instruments that can differentiate 
between 9.29999999999999 and 9.3 ounces. Similarly, examination 
of a coin and the conditions under which it is tossed may lead us 
to say that the probability of heads is §, but we can never be sure 
that it is exactly 5 ; this would require an infinite number of tosses. 

In the case where many observers would be led to attach prob¬ 
ability of 2 , or some number very close to ■§•, to the outcome 
heads, we say that the probability is objective. Objective prob¬ 
abilities, like physical weights do not entirely remove h uman 
judgment. Two reasonable men, with exactly the same evidence 
might arrive at different probabilities for “heads.” But in the case 
of a coin these differences will usually be very small. Let us over¬ 
look the differences and assume in this chapter that reasonable 
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men will arrive at the same probabilities in a given decision situa¬ 
tion; we call this decision making with objective probabilities. 

the decision situation 

Again we denote strategies by Si, S 2 ,..., and outcomes 
by Ci, C 2 , . . . .In addition, we must indicate the probability of 
each outcome. This can be represented conveniently in matrix 
form. The columns of the matrix refer to outcomes, the rows to 
strategies, and the entries in the body of the matrix to the objective 
probabilities. For example, the following matrix characterizes a 
decision situation with 3 available strategies and 4 possible out¬ 
comes. If S x is chosen, C 2 will occur with certainty; the probability 
of C 2 is 1 and the probability of all other outcomes is 0. The entry 
in the first row (referring to Si) and the second column (referring 
to C 2 ) is 1. All other entries in the first row are 0. If S 2 is chosen, 
each outcome has probability For S z , Ci will result with prob¬ 
ability |, C 2 with probability f, C 3 with probability and C 4 with 
probability f. Note that the probabilities in every row sum to 1, 
indicating that these 4 outcomes exhaust all possibilities for each 
strategy. 

MATRIX 2.1: Objective Probabilities for a Given Decision Situation. 

Possible outcomes 



Cl 

Ci 

Cz 

c 4 

S x 

0 

1 

0 

0 

Available « 
strategies 

i 

i 

i 

4 

1 

s 3 

i 

i 

i 

i 


This matrix of objective probabilities can be interpreted: after 
some strategy (say S 2 ) is chosen, a chance mechanism (say a 
roulette wheel with only 4 numbers) determines the outcome; if S 2 
is chosen repeatedly, the chance mechanism will select each out¬ 
come about | of the time. 
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expected value of a strategy 

Suppose an individual had to choose between Si and St 
represented in the following matrix. Choosing Si means playing 
the game where a coin is tossed and the individual wins $5 if a 
head appears and he loses $4 if a tail appears; and Sj is choosing 
not to play. 

MATRIX 2.2: Coin Toss Game. 



Ci — 

c 2 = 

c 3 = 


win $5 

lose $4 

no gain 
or loss 

Play the game = Si 

1 

2 

i 

0 

Don’t play = S 2 

0 

0 

1 


One way to evaluate these strategies is to calculate what would 
happen if either of them were chosen repeatedly. If S 2 were always 
chosen when the situation presented itself, the cumulative out¬ 
come, of course, would be that the decision maker would experi¬ 
ence neither a gain nor a loss, for the outcome at every play would 
be 0. If Si were always chosen, after a large number of plays the 
associated probabilities indicate that about § of the time a gain 
of $5 would result and the other half a loss of $4. For example, 
if Si were chosen 100 times, in about 50 cases a gain of $5 would 
result for a cumulative gain of $250, and in the remaining cases 
a loss of $4 would result for a cumulative loss of $200. After 100 
plays, this would result in a net gain of about $50, or an average 
gain per play of $.50. The larger the number of plays, the closer 
will the average gain per play be to $.50. The average long run 
experience of a strategy is called the mathematical expectation of 
the strategy, or the expected value of the strategy. This is calculated 
by multiplying each outcome by its associated probability and 
adding all the resultant products. For example, the expected value 
of Si is |(5) + |(—4), or 2.5 — 2 = 0.5. This is written: 

E(SJ = 0.5 
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Since expected value represents the average result over a long 
period of play, it once seemed reasonable that strategies should be 
ranked on the basis of this calculation; that strategy with highest 
expected value should be chosen. This criterion would choose Si 
over S 2 in the above example. But there are individuals, even some 
who are not innocent of probability theory, who would not play 
the game, they would choose S 2 . Before they are condemned, let 
us look at an example where the criterion of expected value breaks 
down completely. 

the Petersburg paradox 

Almost from the start it was recognized that the crite- 
rion of expected value would lead to bizarre choices in some 
circumstances. Consider the Petersburg game. The rules are sim¬ 
ple; a coin is repeatedly tossed and the game ends with the 
appearance of a head. If a head appears on the first toss and the 
game ends, you win $2; if the game ends on the second toss, you 
win $2 2 = $4; if the game ends on the nth toss, you win $2 n . 
Suppose the fee to play this game is your entire wealth. You are 
offered the chance to play (call this strategy Si) or not to play 
(call this S 2 ). The expected value of S 2 is, of course, 0. 

To calculate the expected value of Si we must know the prob¬ 
abilities of each possible outcome. Assuming a fair coin, the 
probability that the game will end on the first toss is the prob¬ 
ability of it ending on the second toss is § § = J; the probability of 
it ending on the third toss is \ \ | = §; the probability of it ending 
on the «th toss is (I) 71 . If we multiply these probabilities with their 
respective outcomes (payoffs) and add, we have, 

4(2) + i(4) + |(8) + . . . + 1 /n(n) + ...= 1 + 1+1 + ... 

The expected value of Si is thus this infinite sum less your total 
wealth. If you apply the criterion of expected value, you would 
choose Si over S 2 ; you would risk your total wealth (less $2) and 
play the game. If a head appeared on the first toss, your total 
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wealth would be reduced to $2. This is contrary to what most 
individuals would do (except perhaps for those whose total wealth 
is very tiny) so the criterion of expected value is suspect. Essen¬ 
tially, expected value ignores the risk of a strategy. 

For instance, if one strategy promised either $10 or $20, each 
with probability its expected value would be $15. But a strategy 
promising a loss of $100 or a gain of $130, each with probability 
of i, would also have an expected value of $15. Applying the 
criterion of expected value, an individual would be indifferent as 
to preference between these two strategies. Most individuals, how¬ 
ever, would have definite preferences between them even though 
the two have equal expectations. Those who prefer the first might 
be called risk averters; those who prefer the second, risk lovers. 
In a world where individual preferences are free to be expressed, 
it is improper to call either risk averters or risk lovers wrong or 
irrational. We need a more subtle criterion that will allow indi¬ 
vidual attitudes toward risk to enter into the decision. The next 
section provides this criterion. 


normative decisions with objective probabilities 

The general situation involving objective probabilities 
can be described by the following matrix. (We do not lose much 


MATRIX 2.3: General Decision Situation. 

Basic outcomes or consequences 


Available Si 
strategies 

S% 


S r 


Ci 

a ... 

Cn 

Pll 

Pit 


jPin 

Pn 

P%2 


Pin 










Prl 

Pn 


Pm 






Pa — objective proba¬ 
bility of outcome Cj if 
strategy S* is chosen 
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in generality by assuming that there are only a finite number of 
basic outcomes CT, C 2 , . . ., C n . Note that in this matrix the 
number of available strategies may be infinite.) 

Again we will define a “rational” man as one who adheres to a 
few assumptions or axioms. We introduce 4 axioms from which 
we can deduce an algorithm for ranking strategies. 

AXIOM 1. The individual has distinct and consistent preferences, 
that is, for any two strategies, $< and Sy, either 5, is preferred to Sy, or 
vice-versa, or both. (If each is preferred to the other we again say that 
they are equivalent, or that S,- is indifferent to Sy and we write S< = Sy.) 
Further, if Sy > Sy, and Sy > St, then S, > St. 

A strategy may be denoted as a vector 

*Sy = (pyiCi, PizCz, . . . , Py n C n ) 

We distinguish n strategies (which may or may not actually be 
available to the decision maker) in which all the objective prob¬ 
abilities are 0 except for the one which is 1. These are the pure 
strategies that correspond to complete certainty. For example, 
St = (OCj., 0C 2 , ... ,\C i} , 0 C n ). Since St leads unequivocally 

to Ci as the outcome, we can replace C { whenever it occurs in an 
expression by St. That is, we can identify any outcome with that 
pure strategy which will result in that outcome with certainty. 

If Axiom 1 is valid, we can rank all the pure strategies. Since 
the numbering of strategies is arbitrary, let ST be the most pre¬ 
ferred among the pure strategies, St the second most preferred, 
etc., with St the least preferred. This numbering of course auto¬ 
matically ranks the basic outcomes: Ci > C 2 . . . > C n . 

If only pure strategies are available in a given context, then the 
decision problem is reduced to one of complete certainty of the 
kind that has been explored in Chap. 2. Obviously, if St actually 
appears among the available strategies, it will be the most pre¬ 
ferred. f 

AXIOM 2. Every pure strategy is equivalent to a unique mixed 
strategy involving only Q and C», that is, for every St there exists a 
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unique probability u,- so that Sf = (ojCi, 0C 2 , OC 3 , . . . , [1 — v,]C„). 
(This last strategy can be written [w.-Ci, (1 — u,)C n ], it being understood 
that when any of the objective probabilities is 0, that term will not 
appear in the vector.) The decision maker will be indifferent as to choice 
between the pure strategy S* and the mixed strategy whose outcomes are 
Ci with probability u, and C„ with probability (1 — u;). The probability 
Vi is called the utility of C,-. 


Since this axiom is crucial to the development that follows, it will 
be illustrated with an example. Suppose C'i represents a gain of 
$100, Q a gain of $99, .. ., Cm a gain of $1. This would be a 
decision situation in which the basic outcomes are gains ranging 
from $1 to $100, with higher gains preferred to lesser gains. 
5*6 is the pure strategy promising $25 with complete certainty. Let 
S P = [p $100, (1 — p) $1] be any strategy involving only the best 
and worst outcomes. From axiom 1 we know that S* B and S p 
can be compared. If, for example, p = 0.01, so that S p represents 
the strategy that will result in a gain of $100 with 0.01 probability 
and $1 with 0.99 probability, most individuals would probably 
prefer S%. That is, most would prefer $25 with certainty rather 
than a small chance at winning $100 and a large chance of winning 
$1. (What is the expectation of 5 P ?) Now if p is continuously 
increased, Axiom 2 states that some probability will be reached at 
which S p is indifferent to S* e ; and if this point is exceeded, then S p 
will be preferred to S* & . For example, if p = 0.99, most would 
probably prefer S p over the certainty of $25, since S p now promises 
$100 with a very large probability. 

Suppose when p = 0.5, a decision maker is indifferent as to 
choice between S p and S? 6 , that is, a decision maker is indifferent 
between receiving $25 with certainty and a strategy that will 
return either $100 or $1, both with probability |. (In our notation, 
5?6 = (1-100, 1) = [« 76 -100, (1 - « 7 6 )-1].) Then the probability 

«76 = f is called the utility of $25. Note that this probability, or 
utility, does not refer to the uncertainty introduced by nature but 
to the preferences of the decision maker. 

Axiom 2 implies delicate discriminatory powers on the part of 
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the decision maker. Except where the basic outcomes lend them¬ 
selves to easy discrimination, such as when the basic outcomes 
involve sums of money, it is sometimes said this axiom is un¬ 
realistic and may not be true. 

AXIOM 3. In any strategy, the component strategy C,- can be re¬ 
placed by its equivalent strategy involving only Ci and C n . 

Recall that the pure strategy Ci can be replaced by St, which is 
equivalent to a unique mixed strategy [u t -Ci, (1 — u { )C n }. Axiom 3 
asserts that this last strategy can replace C, in any strategy. For ex¬ 
ample, consider an arbitrary strategy, S = (piC u p 2 C 2 ,... , p n C n ). 
If we replace each C, with its equivalent mixed strategy, we have 

S = {pi[wiCi, (1 - ut)C n ], p- 2 [u- 2 Ci, (1 - «a)C„],. . ., 

Pt^PnCx, (1 


The strategy S' which is called a compound mixed strategy, or 
simply a compound strategy, since each “outcome” Ci can itself 
be represented as a mixed strategy which then leads to one of the 
basic outcomes Ci or C n . Intuitively, this axiom is reasonable if 
only the basic outcomes are relevant to the decision maker; the 
manner in which a basic outcome is attained has no bearing on the 
decision maker’s preference. In Chap. 2 we defined the basic out¬ 
comes so that they incorporated all the relevant aspects of interest 
to the individual. If this can always be done, then this axiom is 
plausible. If, however, the strategy itself carries some degree of 
satisfaction not apparent in the outcome, then the decision situa¬ 
tion will violate this axiom. In other words, this axiom applies to 
situations where it does not matter how you play the game, it is 
what you win that counts. 

A numerical example may make the replacement process clear. 
Let 5 = [i ($10), | ($40), | ($80)]. Suppose that the component 
strategy promising $10 with certainty is equivalent to the basic 
mixed strategy [0.3 ($100), 0.7 ($1)], $40 is equivalent to the basic 
mixed strategy [0.7 ($100), 0.3 ($1)], and $80 is equivalent to 
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[0.9 ($100), 0.1 ($1)]. Substituting these equivalent strategies in 5 
yields: 

ft[0.3 ($100), 0.7 ($1)], *[0.7 ($100), 0.3 ($1)], *[0.9 ($100), 0.1 ($1)]} 

We have replaced each basic outcome ($10, $40, and $80) with a 
mixed strategy that promises a return of either $100 or $1. The 
resulting reduced strategy therefore involves, after a two-step deri¬ 
vation, only the two basic outcomes $100 and $1. The compound 
strategy results in either the best or worst outcome after two steps- 
first, a chance mechanism determines whether the first, second, or 
third equivalent reduced strategies is chosen; second, another 
chance mechanism determines whether the best or worst outcome 
results. The first equivalent reduced strategy will be chosen with 
probability |, the second with probability |, and the third with 
probability *. The following diagram illustrates this compound 
strategy. 



Spin pointer, if it Spin pointer of appropriate wheel 

stops in region 1, to determine whether SiOO or $1 
choose ji; if in 2, results 

S 2 ; if in 3, Sz 


ince either the best or worst outcome will result after two steps 
suggests that any compound strategy can be reduced to a basic 
mixed strategy that is a lottery involving only the best and worst 
outcomes. For instance, note in the diagram above that if the 
compound strategy is chosen a large number of times, in the first 
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step, wheel Si will be chosen about | of the time; and in these 
cases, $100 will result about 30 per cent of the time; that is, $100 
wiE result J (30 per cent) of the time, or 1\ per cent of the time. 
In the first step, wheel S 2 wiE be chosen J of the time; and in these 
cases, $100 wiE result 70 per cent of the time; that is, $100 wEl 
result | (70 per cent) of the time or 35 per cent of the time. Also 
in the first step, the wheel S 3 wiE be chosen | of the time; and in 
these cases $100 wiE result 90 per cent of the time; that is, $100 
wiE result | (90 per cent) of the time or 22| per cent of the time. 
Combining aE the possible ways in which $100 wiE be the final 
result, we see that after a large number of plays, $100 wiE be the 
final basic outcome 1\ + 35 + 22|, or 65 per cent of the time. 
Since we are assuming that it is only the final outcomes that con¬ 
cern the individual, these substitutions and reductions indicate 
that the original compound strategy is equivalent to the reduced 
compound strategy [0.65.($100), 0.35 ($1)]. 

In summary, we started with an arbitrary strategy S, converted 
it into a compound strategy which in turn was reduced to a basic 
mixed strategy involving only the best and worst outcomes. S, the 
compound strategy, and the resulting simple strategy were all 
equivalent. 

In general, if (piCi ,. . . , pC n ) is any strategy, it wiE be equiva¬ 
lent to a basic mixed strategy (qC u (1 — q)C n ) where q = piu x + 
P 2 U 2 + . . . + PnMn, and Ui is the utility of C,. 

AXIOM 4. If any two strategies involve only the best and worst out¬ 
comes, that strategy which places the highest probability on the best 
outcome will be preferred. For example, if S = [0.8 ($100), 0.2 ($1)] 
and S' = [0.7 ($100), 0.3 ($1)], then a rational individual would prefer 
S over S'. 

These four axioms aEow our rational man to compare two 
complex strategies by reducing them to basic mixed strategies of 
the form of Axiom 3 and then choosing the strategy that makes Ci 
more probable. When a complex strategy is reduced to a basic 
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mixed strategy involving only C x and C„, the probability associated 
with Ci is 


P = PiUj + P'iUi + . . . + p n u n , 

where p, is the probability of in the complex strategy and u £ is 
the utility of C t . In a statistical sense, p is the “average” utility 
of the complex strategy; if this strategy is followed repeatedly u { 
will occur with a frequency indicated by the probability p is and 
the average utility experienced will be p. The larger p, the more 
preferred the strategy. Note the resemblance between the expres¬ 
sion defining p and the expression for expected value discussed 
earlier. If we replace each u { by C t , we have the mathematical 
expectation, or expected value of the strategy. We noted that this 
procedure did not permit us to rank strategies. The four axioms 
have allowed us to replace each C, (an objective measure) with 
a Ui (a subjective measure that will vary from individual to in¬ 
dividual), then the strategies were ranked on the basis of expected 
utility, rather than expected value. 

To summarize results so far—if an individual has consistent 
preferences among all possible strategies in a given situation, then 
he can associate with each basic outcome a number we call its 
utility, so that if he prefers C, to C s then u { > u s . Furthermore, 
he will prefer a strategy S, over Sj if the expected utility of Si is 
greater than the expected utility of S s . 


cardinal versus ordinal utility 

The assignment of utilities to basic outcomes is called 
the individual’s utility function. The utility function places a relative 
value on each basic outcome that expresses the individual’s sub¬ 
jective attitude toward risk. This utility function, like the utility 
function for basic outcomes under complete certainty derived in 
Chap. 2 has the property that the more desirable the basic out¬ 
come, the higher the utility associated with it. But unlike the 
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ordinal utility function for riskless situations, the utility function 
derived in this chapter from consistent preferences among strate¬ 
gies is more restricted in its nature. Recall that an ordinal utility 
function when transformed so that the new values maintain the 
same order, will still express the same preference pattern as the 
untransformed utility function. Thus, an ordinal utility function 
that assigns utilities 0, 0.1, 0.3, and 0.4 to Ci, C 2 , C 3 , and C 4 , 
expresses exactly the same preferences, and will lead to exactly 
the same selection of strategy in any given situation as a utility 
function which assigns utilities of 1, 3, 9, and 18 to these basic 
outcomes. 1 

The utility function derived in this chapter does not have this 
ordinal property—the relative magnitudes of the utilities have rele¬ 
vance. In particular, only if each utility value is changed in a 
specific way, will the resulting transformed utility function express 
identical preferences: if each utility value is multiplied by some 
constant, k > 0, to which is added some constant, a, then the 
resulting utility function will lead to the same preference ranking 
among mixed strategies. That is, if u’ = a + ku, then u' defines 
the same preference structure as u. 

If we adopt the convention that u(Ci) = 1 and h(C„) = 0, that 
is, we assign utility of 1 to the best outcome and 0 to the worst, 
then the utility function derived here is unique. This utility func¬ 
tion, where the relative magnitudes are meaningful, is called a 
cardinal utility function. (The word “ordinal” is used to indicate 
that only the order property of the real numbers is used; “cardinal” 
is used to indicate not only the order property, but also the 
property of magnitude.) 

In Chap. 2 we showed that an individual with consistent prefer¬ 
ences, under complete certainty, made decisions as if he were 
maximizing an ordinal utility function. In this chapter we have 
shown that a rational individual (one obeying our axioms) when 

1 This property can be expressed mathematically: if u is an ordinal utility 
function, then /(«) is also a utility function leading to exactly the 
same decisions provided that/' > 0. 
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faced with strategies involving risk, (strategies involving objective 
probabilities) behaves as if he were maximizing a cardinal utility 
function. It is important to note that from consistent preferences 
we deduced utility functions, not vice-versa. The utility function 
is merely a compact description of the individual’s preferences. 
Because we have shown that utility functions exist for rational 
individuals under certain circumstances, we are able to use ana¬ 
lytical methods for choosing the optimal strategy from a large set 
of available strategies without having to compare all strategies. 
T his is the real purpose of the preceding construction. Otherwise 
we could be criticized for having built a complex mechanism 
merely to tell a decision maker that he should choose that strategy 
which he prefers most—gratuitous advice at best. 


resolving the Petersburg paradox 

Historically, the notion of utility preceded the axio¬ 
matic construction presented here. Utility was interpreted as a 
measure of “satisfaction.” In this way, Bernoulli was able to 
explain the Petersburg game. His hypothesis states that individuals 
derive greater and greater satisfaction from increasing wealth, but 
the rate of increase of satisfaction declines in proportion to the 
individual’s wealth: a millionaire receives much less satisfaction 
(utility) from an additional dollar than does a pauper. The hy¬ 
pothesis is that individuals maximize satisfaction, not wealth. 

It must have seemed natural to Bernoulli, at a time when many 
laws of the universe were discovered and expressed in mathemati¬ 
cal form, that man’s satisfaction from wealth should also be 
governed by some law. If there were any justice in the world, this 
law might be expressed with some elegant formula. One mathe¬ 
matical function which had the required properties was the loga¬ 
rithm of wealth. This was seized upon by Bernoulli. First, it ac¬ 
corded higher utilities to higher wealth; its first derivative was 
positive. Second, the rate of increase in satisfaction decreased with 
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increasing wealth; the second derivative was negative. (See Fig. 

3-1.) 

Now an individual offered the chance to play the Petersburg 
game would reach his decision by calculating his expected utility 

| log 2 + \ log 4 + | log 8 + ... + 

minus the utility of the entrance fee. This infinite series has a 
finite sum and so might lead to a decision to play. But unfortu¬ 
nately, this does not resolve the paradox. For now we can con¬ 
struct a game whose expected utility is infinite and again would 
lead an individual to play no matter how expensive the entrance 
fee. Suppose that if the game ends on the first toss, you win Sc 2 , 
on the second toss Sc 25 , on the nth toss, $e 2 \ Using Bernoulli’s 
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utility function, 2 the utility of $e 2 = 2, of Se 2 ’ = 4, $e 2 ” = 2 n , so 
that the expected utility of the game is again infinite; f + f + 

8 _L 

•g- + . . . - 

A payoff scheme can always be devised which will lead to this 
paradox if the utility function is unbounded. Let u(w) be an un¬ 
bounded utility function of wealth. Now consider the inverse of u 
(see fig. 3-2). The inverse of u, written ir\ is defined for every 
value of w, so that we can define the payoff at the nth toss to be: 



= X. 


2 We are considering natural logarithms with base e, so that log (e*) 
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u~\ 2”). Then the utility of this expression is u[u~ 1 ( 2 re )] = 2 n , 


leading to the troublesome infinite sum. 

One escape from the paradox is to assume that the utility is 
always increasing but bounded above. Suppose we assume that 
utility never exceeds 1 no matter how large wealth becomes. This 
immediately suggests the utility function u = 1 — (See Fig. 
3-3.) This gives rise to the sum 

■ 






















1.1 + 1.3 + 1.1+ + L.*2zl + 

2 2 T 4 4 T 8 8 ^ ‘ ^ 2* 2- + 


It can be shown that this infinite series sums to §. The expected 
utility of the Petersburg game is f. This is the utility of $3. There¬ 
fore an individual with this utility function would not play if the 
entrance fee exceeded $3. This seems to characterize an extremely 
cautious individual with high aversion to gambling. A less cautious 
individual might have a utility function of the form 

u(w) =1 —\ where 0 < z < 1 

w z 
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The smaller z, the more an individual would pay. 

But there is no need to search for “the” utility function of 
wealth. It is recognized now that risk attitudes differ substantially 
between individuals. One man’s risk is another man’s insurance 
against boredom. 

PROBLEMS 

1. A mail order firm ran an advertisement for 100 weeks for a $10 
item with the following results: 


OF WEEKS 

NO. OF ORDERS 

5 

100 

10 

120 

20 

180 

40 

200 

20 

210 

5 

280 


If this pattern persists, what are the expected sales for any week? 

2. Suppose in Problem 1 that the firm advertises 3 different products 
with prices of $20, $10, and $5. Most of the orders are for the $10 
item, but 25 per cent are for the $5 item and 20 per cent for the $20 
item. What are the firm’s expected sales for any week? 

3. You have $20,000. You are offered a ship and cargo for $10,000. 
If the ship arrives safely in the Orient you can sell its cargo for 
$12,000. If the ship then returns safely to the United States you 
can sell it for $8,000. 

a. If the probability of a safe voyage out or back is what is 
the expected profit of this investment? 

b. What probability would make it a “fair” bet or zero profit? 

c. If your utility for wealth is the (common) logarithm of wealth, 
what probability would leave you indifferent toward this in¬ 
vestment? 

d. Calculate your own utility for each possible outcome by com¬ 
paring the choice between the outcome with certainty, and 
some lottery involving the best and worst outcome. What 
probability for success of voyage would leave you indifferent 
toward this investment? 
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When we leave the casino or the laboratory, objective probabilities 
seem to disappear. Consider the statement: the probability that 
Jones will be elected president is 0.4. We cannot repeat the election 
many times to see if the number of times Jones is elected ap¬ 
proaches the fraction r^-. It could be argued that if it were possible 
to hold a large number of elections under the conditions that now 
prevail, Jones would be elected about 40 per cent of the time. The 
fact that in practice we cannot carry out this experiment does not 
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invalidate the statement. Perhaps, but this statement is still quali¬ 
tatively different from the statements involving objective prob¬ 
abilities. The essential difference is that two reasonable men weigh¬ 
ing exactly the same information may arrive at very different 
probabilities. In this chapter, we will look at situations where 
o jective probabilities do not exist but where an individual still 
has clear-cut preferences among a set of strategies. 


subjective probability 


. 1 ne most Prevalent (and interesting) decision situations 

do not involve certainty or objective probabilities. Political strat¬ 
egy is based on the likelihood of war and peace, and throughout 
history different policy makers have held very different estimates. 
In fte stock market an individual who believes the likelihood of 
falling prices to be greater than the likelihood of rising prices 
makes a decision to sell a security to an individual whose expecta- 
tions are more sanguine. Bankers and businessmen often view 
quite> differently the chances of success for an enterprise; bettors 
at a horse race often have different “favorites ” 

In this chapter we show that if an individual has clear-cut 
consistent preferences over a specified set of strategies he will act 
as if he assigned probabilities to various outcomes. These prob- 

® ubjectlve Probabilities, have aU the mathematical 
p operties of objectve probabilities except that they are unique 
to the individual. Just as the utility function derived in Chap 3 

va!Tnn Z th If mdlVidUaPs risk attitude by placing a relative 
1 th * ^ °f comes ’ these subjective probabilities sum- 
manze the individual’s intensity of belief in the possibility of 
vanous outcomes. Agam we introduce some axioms; if these 
ms areappropnate m a given situation we show that a 
mtonal individual will make decisions by maximizing expected 



47 


DECISION MAKING WITH SUBJECTIVE PROBABILITIES 


the existence of subjective probabilities 

Consider an individual who must choose between Si, 
campaign for candidate Jones and give up present employment, 
and S 2 , do not campaign and continue in present job. If he chooses 
Si and Jones is elected, he will be appointed to a position paying 
$20,000 a year; if Jones is not elected he will have to accept a 
position paying $10,000 a year. If he chooses S 2 he will continue 
to make $15,000 a year whether Jones is elected or not. His 
decision hinges on his utility for income and his estimate of Jones’s 
chance for success. This decision situation can be summarized in 
the following matrix: 


MATRIX 4.1: Decision Situation Without Objective Probabilities. 

Wi — Jones elected W t = Jones defeated 

51 — campaign 

for Jones 

5 2 = do not 

campaign 


Make $20,000 a year 

Make $10,000 a year 

Make $15,000 a year 

Make $15,000 a year 


Wi and W 2 are the “states of the world” that will be the result 
after the votes are counted. If objective probabilities can be at¬ 
tached to these states, this individual would then choose the 
strategy that maximizes his utility for income. However, if it is a 
typical election beset with many uncertainties (a scandal may taint 
one of the candidates before election; some event such as war, 
outside the influence of either candidate, may change the appeal 
of one candidate to the voters; a slip of the tongue in a campaign 
speech may alter a candidate’s prospects dramatically; a close 
associate of one of the candidates may suddenly be uncovered as a 
felon) then reasonable men faced with the same evidence may 
differ widely about Jones’s prospects. Nevertheless, we will show 
that if the individual now choosing between Si and S 2 has distinct 
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and consistent preferences over a relevant set of alternatives, then 
his choice will be made as if he assigned a numerical quantity to 
Wi (we call this quantity his subjective probability that Jones will 
be elected), and he will operate with this probability to maximize 
his utility in exactly the same fashion as the individual in Chap. 3 
who operated with objective probabilities. 

Let us generalize this situation by introducing s possible “states 
of the world,” W h W 2 , . . ., W s . Again we assume that there are 
n basic outcomes, Cj, C 2> ..., C n , so that he prefers Ci most and 
C n least. Then a decision matrix can be constructed: 

MATRIX 4.2: A Generalized Decision Situation Without Objective Probabilities 
Involving Basic Outcomes. 

W 1 W 2 ... W s 

51 Cn Cu 

5 2 Cn Cn 

Denote by E { the strategy (Ai, A 2 , . . ., D is ), where D Lj is a 
strategy involving objective probabilities 

(1) Dij ~ (PlCl, P 2 C 2 , . . . , PnCn) 

Then we can generalize Matrix 2 even further: 

MATRIX 4.3: A Generalized Decision Situation Without Objective Probabilities 
Involving Risky Outcomes. 

W x W 2 ... W s 

E 1 
E 2 

For example, if E x is chosen in Matrix 3, then the outcome will 
be Dn if Wi is the state of the world, D l2 is the outcome if W 2 is 
the state of the world, etc. To say that Ai is the outcome means 
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that this strategy involving objective probabilities will be the re¬ 
sult; so that now a spin of some wheel will determine which basic 
outcome, C t , will result. For example, if Ai = (piC h p n C n ), 
then a basic outcome is reached in two steps: first, a “true” state 
of the world is observed (in this case Wi) and second, outcome Ci 
will result with objective probability p\ or C n will result with prob¬ 
ability C n . (Da may of course be a pure strategy that results in 
some Ck with certainty.) 

Now, one last generalization; consider the set of all strategies 
of the form 

(2) (piEi, P 2 E 2 , • • • , Pv.Es) 

where again the pi s represent objective probabilities. Strategies of 
this form culminate in a basic outcome after three steps: 

Step 1: Spin a wheel to select A = [A, A, . • • s A]. The 
determination of A is subject to the objective prob¬ 
abilities of Expression (2). 

Step 2: If “true” state of world is Wj then A results. 

Step 3: Spin a wheel to select C k . This determination is sub¬ 
ject to the objective probabilities of Expression (1). 

Let us call the set of all strategies of the form as in Matrix 4.3 
A*, and the set of all strategies of the form of expression (2) R*. 
We can now exploit the axiomatic structure of Chap. 3 if we 
identify A* with the set of basic outcomes, A = (Ci, Ct, , C n ) 
and R* with the set of all possible strategies involving objective 
probabilities and basic outcomes, i.e., with the set R whose ele¬ 
ments appear as (piCi, ptCt,. . ., p n C n )• These constructs are set 
forth in the following table. 

Now if the axioms of Chap. 3 apply for R*, that is, if an in¬ 
dividual has clear-cut consistent preferences among the elements 
of R*, then by precisely the same reasoning in Chap. 3, there exists 
a utility function over R* which we call u*. (It might be argued 
that the strategies in R* are too complex for an individual to 
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OBJECTIVE PROBABILITY SUBJECTIVE PROBABILITY 



{W h fV 2 , . . ., W & ] = possible 
states of world 

A = {Ci, C 2 ,. . ., C n ] = set of 
basic outcomes 

A*= {[(D l9 D 2f ...,D t )]} =set 
of all possible strategies where 
Di results if true state of world is 
Wi. Each Di is an element of R. 

R = {(piC h p 2 C 2 ,... ,p n C n )} — 
set of all possible strategies using 
objective probabilities. 

utility function so that u(Q) = 

Ui and «Q?iCi, . . ., j?nC n ) = 

PlUi + Ptm + * * * 4- PnUn 

R* = HpiEi, . . -, PnE s )} = set 
of all strategies involving objec¬ 
tive probabilities. Each Ei is an 
element of A*. 

i 

u*(Wi) = p* = subjective prob¬ 
ability of state of world W t -. 


compare. This is probably true when the basic outcomes are diverse 
in nature. However, when the basic outcomes involve different 
sums of money, these strategies are not far removed from the 
Irish Sweepstakes where a lottery determines the winners of horse 
race tickets, which in turn determine money prizes to winners.) 

For example, consider the decision situation of Matrix 1. The 
set of basic outcomes consists of C x = make $20,000 a year, 
C 2 = make $15,000 a year, and C 3 = make $10,000 a year. There 
are two possible states of the world, W x = Jones elected, and 
Wi = Jones defeated. An element of R has the form (p x $20,000, 
Pi $15,000, p% $10,000); this is a strategy with objective probabili¬ 
ties that results in $20,000 with probability p x , $15,000 with prob¬ 
ability p 2 , and $10,000 with probability p 3 . Call this strategy A- 
Another element of R with probabilities p[, p' 2 , and p' 3 call A. 
Now [A, A] is an element of A*; it is the strategy that results in 
A if the true state of the world is W x , otherwise it results in A. 
Call this element of A*, E x . Another element of A* may be de¬ 
noted by [A, A] = Ek. Now an element of R* has the form 
(pi'Eu Pi Ei) = Fi. Now suppose the probabilities have the follow¬ 
ing values: 
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Py = 

1 

4 

Pi = i 

pi' = 

1 

8 

P2 = 

1 

2 

P2 = § 

P2 = 

3 

4 

Pz = 

1 

4 

Ps = i 

pi' = 

1 

S 


Then fi is the compound strategy that looks like this 

[i (A, At), | (A s , A), I (Di, AO] = 

[| (§• $20,000, §-$15,000, §-$10,000), 

(§- $20,000, §• $15,000, § • $10,000)], f (. . .) 


This is a very cumbersome expression, but if it does not matter 
to the individual whether a wheel is first spun and then the state 
of the world determined or vice-versa (and this is implicit in our 
assumption earlier that the individual is interested only in the 
basic outcome and not the strategy), then the expression F x can 
be reduced to the form 


(A, DO 


This last strategy results in a compound strategy Di if the true 
state of the world is W u and D' 2 if otherwise. In this way, elements 
of R* are not as complex as they first seem; there are probably 
many situations in which an individual can express clear prefer¬ 
ences among such strategies. 

Now consider those elements in jR* where some unique Ei 
always results from Step 1. We can then identify E { with this ele¬ 
ment; call the utility associated with this element of R*, u*(Ei). 
Again, we consider a special subset of the E i} those of the form 
Ei - [Ci , C j} . . . , C*], i.e., strategies as depicted in Matrix 2. Let 
u*{[Ci, Ci,..., CJ) = 1 and u*[(C B , C„,. .., C B )] = 0, that is, 
the utility of the most preferred strategy (that which will result 
with certainty in the most preferred basic outcome) equals 1, and 
the utility of the least preferred equals 0. Then let 

«!([Ci, C n , C B ,. . ., C n ]) = ut(Wf) = pi 

«!([c n , ci, c»,..., cy) = ut(wt) = pi 


ut{[c n , .... ci, .... cj) = u* t (wt) = P i 
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Then it can be shown (see Reference 1) that the pt sum to 1 
and that 

(3) «*(A, Du..., A) = p\u{D 1 ) + . . . + ptu(D s ) 

The number p* is called the subjective probability of the z'th state 
of the world. (The numbers u(D t ) are the utilities associated with 
the strategies involving objective probabilities.) 

Note that in Expression (3) the numbers pt are used in exactly 
the same way that objective probabilities were used in Chap. 3. 
This individual’s preferences gave rise to the subjective probabili¬ 
ties of the various states of the world and the utility associated 
with each basic outcome, and he ranked preferences by maximiz¬ 
ing his subjective expected utility. 


summary 

Essentially, the axioms of utility theory set out in Chap. 
3 have been used again to derive subjective probabilities. A new 
set of strategies, R*, was defined. These strategies were compound 
in nature and involved objective probabilities and the resultant 
true state of the world. In situations in which an individual can 
express distinct and consistent preferences over this set, we saw 
that his preferences could be summarized by a “utility” function, 
u*, on this set. By normalizing this function, that is, setting u* of 
the most preferred element equal to 1 and u* of the least preferred 
equal to 0, it was shown that u* of the element of R* that returns 
the most preferred outcome C x if state of the world results and 
C n otherwise can be interpreted as the subjective probability that 
the true state of the world is Wi. These probabilities have all the 
mathematical characteristics of objective probabilities; the indi¬ 
vidual’s preferences then dictate that he maximizes subjective ex¬ 
pected utility when choosing a strategy from those available to him. 

It might be worth emphasizing that the demands on the decision 
maker are more severe when working with subjective probabilities 
than when working under certainty or with objective probabilities. 
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PROBLEM 


1. A businessman must choose from S h S 2s and S z in the following 
matrix. (*S 4 and S$ are not available to him.) 


5 

5 

5 

5 

5 


The entries in the matrix represent his profit or loss. The states of 
the world concern our relations with Russia for the year 1972. & is 
the strategy to build a factory in Germany; S 2 is the strategy to 
build a munitions plant in U.S.; Sz is the strategy to build a bakery 
in U.S. The businessman’s utility for money is linear and he wishes 
to maximize his expected profit. He claims he would be indifferent 
between *S 4 (if it were available to him) and a simple lottery that 
returned a loss of 50 with probability \ and a gain of 100 with 
probability f. He is also indifferent between S$ (if it were available 
to him) and a lottery that returned a loss of 50 with probability f 
and a gain of 100 with probability f. 

a. What is the businessman’s subjective probability that war will 
occur; that the cold war will continue; and that peace will ensue? 

b. Which strategy will he select? 

c. Determine your subjective probabilities for W 2 , and W z . 
Then determine your utility for the outcomes —50, 20, and 
100. Which strategy will you select from S h S 2i and S 3 ? 


Peace War Cold War 
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AN APPLICATION 

PORTFOLIO 

SELECTION 


Maximizing utility with subjective probabilities requires delicate 
estimates; first the individual must know his risk attitude (his 
utility function for the basic outcomes), and second, he must have 
a finely graded estimate of the likelihood of future events. When 
the basic outcomes are money income (or gain) it seems reason¬ 
able that an individual will not only be able to rank the outcomes, 
but also to rank various lotteries involving these outcomes. If this 
is so, then his utility function can be determined. When the pos¬ 
sible states of the world are also measured on one axis, such as 
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the possible price of a stock on some date in the future, then it is 
conceivable that an individual can express fine judgments about 
the probabilities of these states. 

One area of human behavior in which these conditions seem 
to exist is the selection of a portfolio of securities. Some partici- 
pants in the stock market seem able to express definite opinions 
about the future course of a security. In this chapter let us assume 
that they can make comparisons between elements of i?* so that 
they have a distinct utility function and a distinct probability 
distribution over the possible states of the world. How will such 
an individual select a portfolio of securities? We examine this 
with some primitive examples. 


selecting a portfolio 

from a universe of two securities 

An investor whose utility function for gain is log (x + 1) 
(where x is amount of gain) is faced with a universe consisting of 
only Security A and Security B. Assume he wishes to invest a fixed 
sum of money by purchasing some combination of A and B, in 
other words, he wishes to assemble a portfolio. The following 
table shows his subjective probability for the outcome of each 
security and the resultant expectant utility: 

PROFIT EXPECTED 

OUTCOME PROBABILITY UTILITY 


SECURITY A 


SECURITY B 


If he were forced to make a choice between these two securities, 
he would choose Security B. Ordinarily he is not so constrained; 
he may invest a fraction of his capital in each security, or perhaps 


0 

+30 

i 

i 

4 

i log (31) = 0.3729 

0 

+5 

1 

J 

1 

2 

4 log (6) = 0.3891 
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just one security, leaving the remainder in cash. In this instance, 
since he does not believe there is any possibility of loss, he will 
not wish to hold cash, but he must decide how to apportion his 
capital. Let p be the proportion of his capital invested in Security 
A, so that 1 — /? is the proportion invested in Security B. If it is 
further assumed that the outcomes are independent, then a prob¬ 
ability distribution of a portfolio can be tabulated: 


PROFIT PROB- EXPECTED UTILITY / 

OUTCOME ABILITY OF PORTFOLIO 



S —:- 


AO) + (1 - p)(0) = 0 

• t 

§ [^(0)] + I [U(5 - 5 p)] + 

A0) + (l-A(5) 

3 

8 

* [U(30p)] + i [ U(25p + 5)1 - 

A30) + (l - P m 

i 

f log (6 — 5p) + i log (30 ,p 4-1) + 

A30) + (l -p)(S) 

i 

8 

| log (2 5p + 6) 


In order to maximize expected utility, set dE(U)/dp — 
-15/8(6 - 5p) + 30/8(30p + 1) + 25/8(25 p + 6) = -3750p 2 + 
980p + 228 = 0, so that p is approximately 0.4. This investor 
would thus invest 40 per cent of his capital in Security A and 60 
per cent in Security B and his probability distribution and expected 
utility would be: - -r ; - '* 


PROFIT EXPECTED UTILITY 

OUTCOME PROBABILITY OF PORTFOLIO 


0 

3 

8 

| log 4 + i log 13 4- i log 16 

3 

I 

- 0.5155 

12 

i 


15 

I 



By considering a portfolio investments, the range of investment 
alternatives has been enlarged. By allowing short sales, opportuni- 
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ties are increasingly enriched, and not only in the obvious way. 
(A short sale allows an investor to profit from a fall in price in 
precisely the same way he can profit from a rise in price. If he 
sells short at $100 and it falls to $90, he gains $10; if it rises to 
$110, he loses $10.) To illustrate the subtle range of opportunity 
open to an investor able to engage in short sales, consider the 
universe consisting of only one common stock and its associated 
warrant. (A warrant is an option to buy a stock. The warrant is 
bought and sold like other securities. Under certain conditions 
the price of the warrant is determined completely by the price 
of the stock. Therefore, if an investor has some subjective expecta¬ 
tion for the stock, this can be translated into a subjective ex¬ 
pectation for the warrant.) By buying various amounts of common • 
stock and warrants, an investor will have a large number of 
possible portfolios. But by buying common and shorting warrants, 
the set of possibilities is extended and it is very likely that in this 
extension lies a portfolio that maximizes expected utility beyond 
the supremum of the subset of portfolios consisting only of pur¬ 
chases. 

For concreteness, consider how an investor whose utility func¬ 
tion is log (x -f- 1) chooses a portfolio from a universe consisting 
only of the ABC common and the ABC warrant. The present price 
of the common is $10 and the warrant is $5. This investor’s capital 

is $1500 and he wishes to select that portfolio which will maximize 
his utility. 

His price expectations can be summarized; 


PRICE OF 

SUBJECTIVE COMMON STOCK 
PROBABILITY AT TIME / 


RESULTING PRICE 
OF WARRANT 
AT TIME t 
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Assume first that this investor cannot sell short. Then he must 
decide how to apportion his $1500 among common stock, war¬ 
rants, and cash. Since his expectations are that the warrant will 
not be above 5 at time t 3 he will not purchase any warrants for at 
best he will show no profit. On the other hand, since'he believes 
that the common stock will not be below its present price at that 
time, he will not wish to hold cash, for the worst that can happen 
(in his view) if he holds only common is that he will show no 
profit. (For convenience it is now assumed that the rate of interest 
is 0.) Thus he will purchase 150 shares of common and the ex¬ 
pected possible outcomes, their probability, and his expected 
utility become: 


PRICE OF 
COMMON 

PROFIT 

OUTCOME 

PROB¬ 

ABILITY 

EXPECTED UTILITY 
OF PORTFOLIO 

25 

$2250 

i 

4 

i log 751 + i log 150 + 

20 

1500 

i 

i log 2251 = 2.3511 

15 

750 

i 


10 

o ! 

i 



Assume now that this investor is able to sell short so that he may 
apportion his $1500 between buying common and shorting war¬ 
rants. Let p be the proportion of his capital devoted to purchasing 
common and 1 — p the proportion devoted to shorting warrants. 
His expected profit is a function of the price of the common: 

PRICE OF PRICE OF PROFIT 

COMMON WARRANT PROBABILITY OUTCOME 


10 

0 

i 

4 

1500 - 1500p 

15 

0 

1 

4 

1500 - 750 p 

20 

0 

1 

4 

1500 

25 

5 

I 

4 

2250/> 


Expected profit if p{ 1500) is invested in common at 10 and (1 — j?)(1500) is invested 
selling short warrants at 5. 
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His expected utility is E(U) = * log (1501 - I500p) + i w 
(1501 — 750p) + j log 1500. Therefore, by setting dE(U)/dp = o 
it can be seen that his utility will be maximized when p is approxi¬ 
mately 0.42, so that this investor will purchase 63 shares of com¬ 
mon and sell short 174 warrants. With this portfolio the possible 
outcomes, their probability, and the utility of the portfolio be- 
come: 


PRICE OF 
COMMON 

PROFIT 

OUTCOME 

PROB¬ 

ABILITY 

EXPECTED UTILITY 
OF PORTFOLIO 

10 

870 

i 

4 

I log 871 +1 log 1186 + 

15 

1185 


i log 1501 + J log 946 = 

20 

1500 

I 

3.0416 

25 

945 

i 



Shorting warrants against purchases of common stock resem¬ 
bles an arbitrage operation instead of a guaranteed profit regard¬ 
less of the future course of prices, an investor who “hedges” by 
selling warrants and purchasing common usually extends the range 
m which future prices may profitably lie. This extension, coupled 
with a suitable subjective probability distribution, may yield a 
larger expected utility than otherwise possible. Warrants and other 
convertible securities appeal to many investors for precisely this 
reason. In the above example, by “hedging,” this investor was 
able to increase his expected utility from 2.3511 to 3.0416. 

Note that utility maximization resulted in a diversified port- 
oio at is, a portfolio that did not specialize in any one 
security. If instead of maximizing utility an individual attempted 
to maximize expected return, he would simply choose that security 
W , 1C a ^Shest expected return. This behavior is rarely 
observed. Again, utility maximization, although a normative con- 

a2ly”Svf ° ^ the ^ f ° r eXPlatoiD8 h ° W individl,als 
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efficient portfolios 

In the real world, where thousands of securities are 
available to an individual investor, consciously maximizing a 
utility function is an almost impossible job. For this reason other 
techniques have been proposed. One concept that has given rise 
to much empirical and theoretical research is that of an “efficient 
portfolio,” due to Markowitz (see Reference 1). 

The riskiness of an asset, or a portfolio, hypothesized Marko¬ 
witz, can be expressed by a measure called the variance of the 
return. Consider the following two securities with their subjective 
probability schedules: 


SECURITY 

RETURN 

PROBABILITY 

A 

0% 

i 


20% 

i 

B 

-20% 

i 


+40% 

i 


Both securities have an expected return of 10 per cent but 
Security B has a larger range of possibilities—it is possible to lose 
more and gain more with B than with A, although on average 
both will return 10 per cent. 5 " ’ ' ’ 

Suppose then, that investors consider not only expected return, 
but the variance of that return; 1 that for securities with the same 
expected return, the one with the least variance will be most pre¬ 
ferred. For example, in the previous example. Security A will be 
preferred to B, and in general, of all the portfolios offering the 
same expected return, that portfolio with least variance will be 

1 The variance of a probability distribution is defined as the expected value 
of the squared differences from the mean. In symbols, var (X) = 
E{[X-E{X)Y). 








62 


AN APPLICATION PORTFOLIO SELECTION 


most preferred. Similarly, for all portfolios with the same variance, 
that one with greatest expected return will be most preferred. 
In Fig. 5-1, the shaded area represents all the possible portfolios 



available to an investor. .The heavy curved line represents all the 
efficient portfolios in this set. For example, portfolio P has the 
smallest variance of all those portfolios with an expected return 
of 10 per cent. If expected return and variance are the only two 
characteristics that determine the suitability of a portfolio, then 
this investor may be willing to trade olf higher return against 
higher variance, that is, he will accept a portfolio with larger 
variance but he must be compensated with a larger return. This 
suggests that he has an indifference map as indicated by the 
dashed lines in Fig. 4-1. The lower the curve, the more preferred 
the portfolios thereon. In thus attempting to mavimiVp this utility 
function of expected return and variance, he will select that port- 
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folio, of those available to him, that is tangent to his lowest 
indifference curve. He will select portfolio Q. 

This technique has proved popular because many computa¬ 
tional algorithms exist that can determine the efficient portfolio 
(given the investor’s subjective expectations). Thus, when con¬ 
fronted with this efficient portfolio, his area of choice having been 
limited enormously, it is presumably easier to decide on the best 
portfolio. 

What is the connection between efficient portfolio selection and 
utility maximization in general? It can be shown (see Ref. 1, 
pp. 286-87) that they are entirely equivalent in the case where the 
individual’s utility function for gain is a quadratic function of the 
form U(R) = aR + bR 2 + c. Of course many curvilinear relations 
can be well approximated by a quadratic function, and in these 
cases, efficient portfolio selection can be a very useful approxima¬ 
tion to utility maximization. Thus portfolio-selection theory may 
be viewed as an improvement over the criterion of maximizing 
expected value which is equivalent to maximizing a linear-utility 
function—the improvement due to the introduction of quadratic 
rather than linear-utility functions. 

But we must not forget that the quadratic utility function has 
many unpleasant properties: 

1. Beyond some finite gain it is monotonic decreasing implying 
negative-marginal utility 

2. An increase in wealth, no matter how great, will not result 
in the increase of purchase of a risky asset, characterizing 
risky assets as inferior goods 2 

3. It cannot discriminate between skewed distributions; a quad¬ 
ratic utility function ranks the following two assets as equally 
desirable: Asset A with 0.1 probability of 80 per cent loss 
and 0.9 probability of 20 per cent gain, and Asset B with 
0.1 probability of 100 per cent gain and 0.9 probability of 

2 See John Pratt, “Risk Aversion in the Small and in the Large,” Econo- 
metrica, 32, No. 1-2 (1964), pp. 122-136. 
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0 per cent gain. Both assets have expected value of 10 per 
cent and standard deviation of return of 30 per cent, but 
most investors would not view them equally desirable 3 

PROBLEMS 

1. Calculate and draw the efficiency set of portfolios consisting of cash 
1 and one asset which has an expected return of 10 per cent and a 
fri variance of return of 10 per cent. 

2. The same as Problem 1 but instead of cash, consider a riskless invest¬ 
ment (say government bonds) with zero variance and return of 3 per 
cent. 

3. Show that if 2 assets have independent returns, even though one 
asset may have a higher return and a lower variance, it is possible 
that some combination is more efficient than the superior asset 

REFERENCES 

1. MARKOWITZ, h. M., Portfolio Selection. New York: John Wiley & 
Sons, Inc., 1959. Markowitz wrote the seminal article on port¬ 
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2. sharpe, w. f., “A Simplified Model for Portfolio Analysis,” Manage- 
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^ computational simplification that makes the Markowitz model 

accessible. 

3. Journal of Financial and Quantitative Analysis, June 1967. This 
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by W. F. Sharpe, B. A. Wallingford, P. A. Samuelson, E. F. 
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8 This example is taken from Howard Raiffa’s brilliant Decision Analysis 
.(Reading, Mass.: Addison-Wesley Publishing Company, Inc., 1968), 


IN THE ABSENCE 
OF PROBABILITIES 

CHAPTER SIX. 


The decision situation can be generalized in many directions. Until 
now it has been assumed that the decision maker has clear-cut 
preferences among strategies. Let us continue to assume that his 
preferences among pure strategies (strategies leading to unequivo¬ 
cal outcomes) are still consistent and transitive, but that when he 
leaves the world of certainty he cannot express consistent prefer¬ 
ences among strategies because of his total ignorance of the states 
of the world. For example, upon arriving in a foreign country 
about which he knows nothing, a decision maker may be asked 
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to choose from a list of 5 candidates the one most likely to win 
an election. Because he has no prior knowledge or experience in 
this situation he may not be able to express any preference among 
lotteries involving the candidates. Thus we will not be able to 
deduce his subjective probabilities for the states of the world. 

As might be expected, there is no completely satisfactory way 
to handle these situations. After all, if the decision maker does 
not know his own mind, it is difficult to advise what is “best” for 
him. But because many decision makers perceive themselves to be 
in exactly this position, many criteria have been proposed for 
making choices in this situation. We will examine some of these 
criteria along with their shortcomings. 


the principle of indifference 

If there are n possible states of the world and a decision 
maker has no prior knowledge, intuition, or experience concerning 
these states, then perhaps he should assign equal probability to 
each state, i.e., assign probability l/n to each state of the world, 
and then select that strategy with greatest expected utility. This is 
the criterion dictated by the so-called principle of indifference; it 
is also called the Laplace criterion. At first blush, this is intuitively 
compelling. But it has been under constant attack since first pro¬ 
posed by James Bernoulli. “No other formula in the alchemy of 
logic has exerted more astonishing powers. For it has established 
the existence of God from the premise of total ignorance.” 1 
A simple example will indicate the difficulties to which this 
criterion may lead. Suppose you arrived in a country of which you 
know nothing and are given the following facts: the population 
ranges in age from 0 to 100, and everyone’s income is exactly the 
square of his age. In particular, you have no knowledge concern¬ 
ing the distribution of ages. A name is selected at random from 

1J . M. Keynes, A Treatise on Probability (New York: Harper and Row, 
Publishers, 1962), p. 82. 
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the census files and you wish to assign a probability that the person 
selected is less than 50 years of age. The indifference principle 
leads you to assign probability §. Call this Situation 1. 

Suppose instead that you wished to assign a probability to the 
statement that the selected individual had an income of less than 
$5,000. You know only that incomes range from $0 to $10,000. 
The indifference principle leads you to assign probability \ to this 
statement. Because of the connection between age and income, this 
is entirely equivalent to saying that the probability is § that the 
selected individual is less than 70.7107 years of age (70.7 2 = 
$5,000). This contradicts the assignment of probabilities arrived 
at in Situation 1. 

Figure 6-1 illustrates the difficulty. This criterion asserts that 
since nothing is known about the distribution of age in this 



FIG. 6-1 



68 


IN THE ABSENCE OF PROBABILITIES 


country, we should assume the uniform distribution, that is, the 
distribution which assigns equal probability to every age. Simi¬ 
larly, since nothing is known about the distribution of income, we 
should assign the uniform distribution to income. But since age 
and income are not related linearly, a uniform probability dis¬ 
tribution for age implies a nonuniform-probability distribution 
for income, and vice-versa. 

In spite of this, and other difficulties (Keynes, ibid.. Chap. 4) this 
criterion has many supporters. The nub of the difficulty lies in 
defining states of the world that have equal probability. If the 
decision maker truly believes that the states are equally probable, 
then he has specific consistent preferences among strategies involv¬ 
ing lotteries and he is not in a state of “complete ignorance.” On 
the other hand, if he is applying equal probabilities only because 
of his ignorance, then he must not lose sight of the possibility of 
paradoxical behavior. 

the maxi-min criterion 

A defensive, pessimistic decision maker, perhaps one 
who has never won a contest, invariably finds himself in the wrong 
line at supermarkets, and whose choice of entree is always less 
appetizing than his wife’s, might decide on the following criterion: 
for each available strategy consider only the worst possible out¬ 
come, then choose that strategy with the best of these worst 
outcomes, that is, maximize the minimum outcome. This is the 
maxi-min criterion. This criterion implies that the world is against 
the decision maker. No matter which strategy he selects, he believes 
he will be dealt the worst possible outcome. Under most circum¬ 
stances, this appears to be too cautious a stance. 

the maxi-max criterion 

An incurable optimist, one who conveniently forgets all 
his losses and remembers only his gains in the stock market, one 
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whose recollections of Las Vegas consist entirely of those winning 
passes, one whose faith in the goodness of the world is unbounded, 
might choose the following sanguine criterion: For each available 
strategy consider only the best possible outcome; then choose that 
strategy with the best of these best outcomes, that is, maximize 
the maximum outcome. This is the maxi-max criterion. Most deci¬ 
sion makers would consider this too audacious. 


the hurwicz-optimlsm index 

The maxi-min and maxi-max criteria are extreme cases 
of a more generalized criterion. Hurwicz has asked, why con¬ 
centrate on only the best outcome (maxi-max) or worst out¬ 
come (maxi-min) associated with each strategy ? By taking some 
weighted average of the best and worst outcomes he proposed an 
index of optimism that would not necessarily categorize the deci¬ 
sion maker as a hopeless pessimist or total optimist. To illustrate 
his criterion, consider the following decision situation where the 
entries in the matrix represent the utility of the payoff: 

MATRIX 6.1: 



Wi 



Si 

10 

0 

-1.2 

$2 

8 

4 

0 

s z 

3 

3 

3 


A decision maker using the maxi-max criterion would select Si, 
since this strategy will yield the greatest payoff if God is in His 
heaven. On the other hand, the maxi-min criterion would select S 3 , 
since the worst the decision maker can experience is a payoff of 3, 
which is greater than the other “worsts.” Hurwicz argues that each 
individual’s temperament probably lies somewhere between total 
optimism and pessimism. This can be expressed by looking at the 
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best and worst outcomes associated with each strategy and taking 
some weighted average. Call the weight given to the worst out¬ 
come, a, a number between 0 and 1 , so that the weight given 
to the best outcome is 1 - a. For example, if a. = 0 . 2 , we weight 
the best and worst outcomes associated with strategy 1 by: 
0.2(-1.2) + 0.8(10) = 7.76, which is called the a index of Si. 
Thus, with each strategy is associated an a index. The criterion 
then dictates that the strategy with the highest index should be 
selected. Note that if a = 0, this criterion is equivalent to the 
maxi-max criterion and if a = 1 , it is equivalent to the maxi-min 
criterion. 

Let rm be the worst outcome possible if S t is chosen and 
the best outcome. Then the a index associated with eacn Si is 

am l + (1 — a)Mi = Mi -f- ainii — Mi). In the above example, 
the a indices would be 


Si: 10-11.2 a 
So'- 8 — 8a 

S 3 : 3 

The a index is a linear function of a. For the above situation, 
the graphs of the indices are plotted in Fig. 6-2. If the decision 
maker s oc is less than f, then Si is the superior strategy; if a is 
greater than f, S 3 is the superior strategy. For no value of a is S» 
a preferred strategy. Therefore, in running the gamut from pessi¬ 
mist (a = 1 ) to optimist (a = 0 ), the Hurwicz criterion would 
never rank strategy 2 as the superior choice. We next consider a 
criterion that does choose S 2 as the superior strategy, indicating 
the not so satisfactory state of affairs when probabilities cannot 
be assigned to the states of the world. 


the savage regret criterion 

Investors in Wall Street must constantly select strate¬ 
gies. Judging by a small biased sample of conversations at cocktail 
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FIG . 6-2 


parties it appears that often the buyer of a stock will elect to sell 
his shares after they have moved up, perhaps to double their cost 
price, only to see them move up even more sharply after they have 
been sold. If the proceeds of the sale have been reinvested in a 
situation that remains relatively stagnant, the investor experiences 
definite pain and discomfort; hence the many tales of how rich an 
investor would have been if he hadn’t sold his IBM stock. Savage 
(Ref. 4 on p. 54) attempts to deal with the anguish resulting from 
guessing wrong, or, choosing the wrong strategy. Consider once 
more the decision situation given by Matrix 6.1. Assume the true 
state of the world is Wi. If S% is selected, the decision maker may 
regret that he did not choose S u for then he would have gained a 
utility of 10 rather than 8. If he selected S 3 , his regret would be 
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stronger—there was a possibility of gaining a utility of 10 rather 
than the 3 he experienced by choosing S 3 . Let a ih measure his 
regret by the difference between his maximum utility given the z'th 
state of the world and the utility received from selecting S } -. The 
resulting matrix of these differences is called the regret matrix. The 
regret matrix for Matrix 6.1 is shown as Matrix 6 . 2 . 

MATRIX 6.2: Savage Regret Matrix for Decision Situation of Matrix 6.1 


Sy 

Si 

Si 


Wy Wy W t 


0 

4 

4.2 

2 

0 

3 

7 

1 

0 


In a sense, the entries in the regret matrix represent the “cost” 
of guessing nonoptimally. A decision maker is concerned with 
minimizing his cost. If he were cautious (or pessimistic, or overly 
defensive) he might decide to m i n im i ze his maximum possible loss. 
That is, he would rank each strategy by its maximum possible 
loss and select that strategy with the smallest such maximum. 
Thus Si has a maximum possible loss of 4.2, S 2 of 3 , and S 3 of 7. 
This would rank S 2 as more preferred than Si which in turn is 
more preferred than S 3 . This procedure of selecting the strategy 
which minimizes the maximum possible loss (regret) is called the 
savage regret criterion. 

Note that for the decision situation of Matrix 6.1, the maxi-min 
Criterion selects S 3 , the maxi-max criterion Si, and the Savage 
regret criterion S 2 . To confuse matters further, there is no a priori 
reason why the regret matrix could not be used with some sort 
of a index, rather than resorting to the most pessimistic view. 

Thus in the absence of probabilities it has not been possible to 
demonstrate that there exists a “best” strategy for the decision 
maker. The difficulty arises because we have defined “best” to 
mean most preferred by the decision maker and in the case of 
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complete ignorance concerning the states of the world, the decision 
maker is unable to rank all possible alternatives consistently. This 
is an unhappy state for an 6 ‘expert.’ 5 When convinced that the 
decision maker is truly operating in complete ignorance of the 
state of the world, he can offer conflicting advice depending upon 
selection of criterion. 

PROBLEMS 

1. Consider the following decision situation where nothing is known 
about the probabilities of the states of the world. 



Wi 

W% 

W z 

W 4 

Si 

12 

3 

6 

6 

s 2 

10 

0 

10 

10 

Sz 1 

5 

4 

5 

5 


18 

0 

4 

3 


a. Which strategy would the Laplace criterion select; the maxi¬ 
max criterion; the mini-max criterion? 

b. Graph the a index for each strategy as a function of a. For 
which values of a will S± be selected? 
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FURTHER 

GENERALIZATIONS 


strategy selection under conflict 

In operating without probabilities, there is one special 
case that has been examined extensively—decision situations in¬ 
volving an opponent rather than a state of the world. This area of 
decision making is known as game theory. We begin by consider¬ 
ing a 2-person game: a decision maker and one opponent. The 
decision maker is player 1 and his opponent player 2. Assume 
further that the decision maker and his opponent are in complete 
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conflict concerning the outcome. (At the other extreme both may 
be m complete agreement, in which case the decision maker and 
his opponent will obviously choose the strategy which simultane¬ 
ously maximizes each one’s utility.) If both players know all 
strategies available to each other the game can be characterized 
by a matrix. As an example: 

MATRIX 7.7: A Game Between a Decision Maker and an Opponent 
W 2 W 3 w t 

Si 

s 2 
s 3 



In Matrix 7.1 the decision maker must choose between strategies 
S u S 2 , and S 3 . His opponent must choose between W 1 , W 2 , W 3 , or 
Wi \, The entries in the matrix represent the utility of the outcome 
to the decision maker, since we assume this to be a case of complete 
conflict, they also represent the futility to the opponent. For 
example, these entries might be the money payoff that the oppo¬ 
nent must make to the decision maker where both have linear 
utility functions for money. Thus if the decision maker chooses S 2 
If f opponent W 4 , then the opponent pays the decision maker 
&». This case of complete conflict where one player’s gain is 
exactly equal to the other’s loss is called a zero-sum game. When 
facing an opponent rather than states of the world, the decision 
maker may decide to select the strategy that will yield the greatest 
return m view of the opponent’s possibilities. The military strategy 
of the United States is said to be based on this criterion. 1 Rather 
than try to predict what the enemy will do (attach probabilities 
o his strategies), a military commander selects a strategy on the 
basis of the punishment the enemy is able to inflict on his forces. 
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This is equivalent to selecting a strategy by invoking the maxi-min 
criterion. It is important to keep in mind that nothing in game 
theory points to the maxi-min criterion as the “best” method of 
analysis for the decision maker—it is assumed that the decision 
maker has already chosen this criterion. It has been previously 
noted that this is a cautious, defensive posture. Under conflict with 
a malevolent opponent in a zero-sum game this criterion has 
much appeal. For the remainder of this chapter we assume that 
this criterion is chosen to guide the decision maker. 

If both players of the game depicted in Matrix 6.3 use the maxi- 
min criterion what will be the outcome? First, the decision maker 
will select S 2 since this maximizes the minimum payoff, and he is 
sure to receive at least 4. His opponent, wishing to minimize his 
loss will select W 2 since this minimizes his maximum payoff. The 
outcome will be that the decision maker receives 4 from his op¬ 
ponent. Note that if the decision maker chooses S 2 and his 
opponent does not choose W 2 , the decision maker receives more 
than 4; similarly if the opponent chooses W 2 and the decision 
maker does not choose S 2 the opponent will have a loss of less 
than 4. Since both players know each other’s alternatives, and 
since both are invoking the cautious maxi-min criterion, there will 
be no incentive for either of them to veer from choosing S 2 and 
W 2 . In this particular case, the outcome is said to be in equilibrium. 
It is not difficult to show that an outcome is in equilibrium if, and 
only if, it is the minimum entry in its row and the maximum entry 
in its column. (See Prob. 1.) 

A 2-person zero-sum game may not have an equilibrium out¬ 
come. For example: 

MATRIX 7.1: A Game with No Equilibrium Solution 


St 

Si 


Wi Wi 


3 

1 

2 

4 
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In this game, the decision maker would choose S 2 by the maxi- 
min criterion, since he is then assured a gain of at least 2. His 
opponent would choose W u since he is then assured of paying out 
at most 3. But since the decision maker knows his opponent’s 
choice based on the maxi-min criterion he would then be moti¬ 
vated to choose Si to increase his return to 3. Similarly, his 
opponent can also follow this reasoning and so instead would 
choose W 2 . As a result, there is no equilibrium outcome to this 
game. But von Neumann’s celebrated theorem proves that there 
is a pair of strategies that will be in equilibrium for every 2-person 
zero-sum game if randomized strategies, rather than only pure 
strategies are allowed. A randomized strategy is one that places a 
probability on each available strategy rather than a straight or 
pure selection. For example, a randomized strategy might be: 
choose Si if the flip of an unbiased coin results in a head, otherwise 
choose S 2 . The proof of this theorem and further consequences of 
2-person zero-sum games will not be developed here. Instead we 
consider next a generalization which seems to face many decision 
makers. 


nonzero-sum games 

Zero-sum games do not capture the vitality of many 
actual situations. With the possible exception of some parlor 
games, one player’s gain seldom equals the other’s loss. Even in a 
simple game between an adult and a child, the child’s gain may 
actually represent a gain in utility for the adult who loses pur¬ 
posely. This is an outcome in which both players gain a positive 
utility. 

There is one vexing nonzero-sum game that has received much 
attention. It is called the prisoner’s dilemma. We illustrate this 
situation with a different interpretation from the traditional one, 
which gave rise to its name. Consider a “game” between two world 
powers. The decision maker for one world power must choose 
between the following two strategies: 
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Sr. Disarm and divert resources to constructing water de¬ 
salinization plants 

S 2 : Continue to arm at the present rate 

His opponent, representing the other power, must choose between 
W 1 and W % which represent the same strategies available to the 
decision maker of Power 1. If the outcomes are described by the 
billions of dollars gained or lost by each power, the game can be 
represented by the following matrix: 

MATRIX 7.2: Nuclear Power Nonzero-sum Game 


Si 


St 


Since this is a nonzero-sum game, each entry in the matrix must 
show the gain to each player. The first number in each parentheses 
represents the gain to Power 1 and the second number the gain to 
Power 2. 

To rationalize these entries, let us consider the four possible 
cases. (Si, W x ): Both powers disarm. The payoff to each is $50 
billion, the benefits of desalinization. (Si, W%): The decision maker 
disarms and his opponent continues to arm. Power 2 takes over 
Power 1 since it is now able to do so. The effect on Power 1 is a 
loss of $110 billion, perhaps in goods and services it must give to 
Power 2. The payoff to Power 2 is $60 billion, since it must still 
subject Power 1 to military sanction and the $50 billion armament 
cost deducted from the $110 billion leaves a payoff of $60 billion. 
(S 2 , W0: The same as the previous case with the position of the 
powers reversed. (*S 2 , W0* Both continue to arm with each re¬ 
ceiving a negative payoff of $50 billion. In this last case, their 
negative payoff is actually greater than the cost of armament since 
now the probability of war increases, in which case both will suffer 
additional expected costs. To illustrate the dilemma it is not neces¬ 
sary to consider this additional cost. 


Wi w % 


(50, 50) 

(-110,60) 

(60, -110) 

(-50, -50) 
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The decision maker analyzes the situation in this manner: if the 
opponent selects W 1 my optimal strategy is S 2 since 60 exceeds 50; 
if the opponent selects W 2 my optimal strategy is again S 2 . His 
opponent, m a completely symmetrical argument, decides that his 
optimal strategy is W 2 . Thus with 2 “rational” players, the out¬ 
come of the game is a loss of $50 billion for each. 

The preceding analysis has assumed that there was no collusion 
between both powers. If communication exists between the two 
players it seems likely that they could reach an agreement of 
bilateral disarmament, since this represents an outcome that is 
better for both. If communication or cooperation is not possible 
or if a similar situation involves “many” players, then the “best” 
outcome for all players will probably not result, since it will then 
be m each player’s self-interest to choose to arm. (This decision 
situation is called the prisoner's dilemma because in its original 
formulation it involved 2 prisoners who were not in communica¬ 
tion with each other and who had to choose between confessing 
or not confessing, realizing that the other was faced with exactly 
the same strategies.) 

In a many-person game involving the prisoner’s dilemma the 
outcome will be suboptimal if each player seeks his own self 
interest. (Suboptima! mea ns that there exists another outcome 
which wdl improve at least one player’s payoff without reducing 
that of any other player.) These games are “unfair” because if 
some players act m the common good, but others do not, the 
good guys finish last. In these situations, some apparatus is 
needed to coerce each player into acting differently than “ra- 
tionahity” ictates. In the “game” of international politics, Ken¬ 
neth Boulding makes an eloquent appeal for strengthening 
supranational institutions to the point where they can coerce 
players into optimal behavior. “Indeed, for the meagre resources 
that we devote to the international order (e.g., the United Na¬ 
tions), we get a remarkable return. This becomes apparent when 
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we consider that the combined budget for all the international 
agencies is less than that of the Ford Foundation.” 2 

collective decision making 

Individuals do not have a monopoly on decision mak¬ 
ing. Groups of individuals—members of clubs, boards of directors, 
citizens of a state, must frequently select a strategy from some 
available set. If all members of the group have the same prefer¬ 
ences and probability estimates, then the situation is identical with 
individual decision making—any member of the group can be 
considered the decision maker for the group. His strategy selection 
will be unanimously endorsed by the others. 

In cases where the individual members of the group have dif¬ 
ferent preferences, the situation is quite different. A group may 
select a strategy by tradition, in which people who have always 
acted in a certain manner shall continue to do so; or by allowing 
some one individual (father, chairman, king) to decide for the 
group. We in the Western world are suspicious of this last proce¬ 
dure. We are fearful that the individual deciding for the group 
may not be benevolent or wise—or worse, that he may behave 
selfishly. The “fair” solution that first comes to mind is to resort 
to majority rule to express the preferences of the group. Unfortu¬ 
nately, in a situation involving at least 3 strategies, majority rule 
sometimes leads to inconsistent choices. For example, suppose a 
group of three individuals, h, h, and h must choose a strategy 
from the available set Si, S 2 , and S 3 . The following tabulation 
shows the individual preferences of each individual, with his most 
preferred strategy at the top of the column and his least preferred 
at the bottom. 

5 Kenneth E. Boulding, “The Learning and Reality-Testing Process in the 
International System,” Journal of International Affairs, (1967), XXI, 
pp. 1-15. 
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Si 

s% 

S , 


s 2 

Sz 

Si 


Sz 

Si 

s 2 


For example. Individual 3 prefers S» to £ and S x to S 2 . If majority 
hi 6 “ ° f ^ tate ^ rou P s PTeferences among the strategies, we 
• VC , e , 0 ow T g ' ^ — ^2 (where > again denotes preference) 
since both /i and/, prefer & to S 2 ; £ > s, since h and h prefer 

•'t v ? - Sl Since 72 and 7 P refer * to This is an 
intransitive ordenng, because with S x > S 2 and S» > S 3 , transi- 

tivity requires that S l > S 3 , which is not the case with majority 

This intransitivity would lead to the choice of any one of the 
strategies depending upon the order in which choice between pairs 
is taken. For example, if the three individuals are asked to choose 
rst between S, and £>, with the winner of this round to compete 
f' S t Ul1S 3 ’ the final choice will be S s . If however, they are asked 

th fl S ^ °u° S - e between S * and ^ with the winner pitted against S 2 , 
he final choice will be S 2 . Similarly, if the first round is between 

^ and &, the final choice will be S,. Thus the final choice of 

rS ll X C ° mple ] el y arbitraf y and depends upon the manner 
in which the first round of candidates is presented. An intransitive 

preference structure violates our “rationality” axioms bec^STIt 
leads to inconsistencies of this type. 

It k f° Wn aS the Arrow Paradox, after Kenneth 
Arrow (see Ref. 1). Arrow has shown that if a group’s preference 
structure is to maintain certain “reasonable” conditions then we 
cannot count on majority rule to lead to a “reasonable” preference 
ranking for the group. We will not consider the “reasonable” 
conditions that Arrow contends should be imposed on a group 
preference; although they have been criticized and analyzed in 
many places (see especially Ref. 3, Appendix 2). Instead, we will 
examine the,conditions necessary for majority rule to lead to a 
reasonable solution, that is, a transitive, group preference rank- 



83 


FURTHER GENERALIZATIONS 


ing. Since majority rule is so firmly entrenched in our institutional 
procedures it is well to know when it may lead to paradoxical 
results.- 

Suppose an odd numbered group had to select a strategy from 
the available set of S u S 2 , S 3 , and There are 24 ways in which an 
individual can rank these strategies. If some of these 24 ways 
never arise, it can be shown that majority rule will lead to a transi¬ 
tive group ordering of preference. A graph can illustrate which 
individual preference rankings are admissible in order that major¬ 
ity rule leads to rational collective preferences. 

In Fig. 7-1 the available strategies have been located on a linear 
scale. If each individual in the group viewed each strategy as hav¬ 
ing more or less of some attribute which determines its attractive¬ 
ness and the degree of this attribute determines the strategy’s 
position on the line, the position of the four strategies might be 
as indicated in Fig. 7-1. For example, the 4 strategies might repre- 


h $2 h s 3 
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/ 
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MmSm 


19 
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02“ 


F/G. 7-7 


sent the choice of 1 of 4 candidates, with each candidate having 
some amount of liberal quality—the further a candidate is to the 
left on the line the more liberal he is, and the further to the 
right the more conservative. Suppose Individual 1, /i, had some 
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ideal amount of liberalism or conservatism he wished to see in a 
candidate, and ranked available candidates by the amount they 
deviated from this ideal. In Fig. 7-1, suppose the point marked A 
represents this individual’s ideal. Then by projecting the position 
of the available strategies upon the two 45° dashed lines emanating 
from A we can indicate the preference structure of A- If we number 
his least preferred strategy 0 la and use this as a base line from which 
we measure his preference for the other strategies, we see that his 
preference function indicated by the heavy line rises to a peak at 
S 2 and falls away monotonically from this peak. Similarly, if 
Individual 2 has an ideal at the point marked A, his preference 
function will have a peak at S 3 and fall away monotonically on 
both sides. If all individuals in the group have single peaked 
preference functions, that is, have some ideal along the line and 
rank strategies by their distance from the ideal, then Black (Ref. 2) 
has shown that majority voting will lead to a transitive group 
preference. Therefore in assessing the rationality of majority vot¬ 
ing it is helpful to determine if only single-peak preference func¬ 
tions characterize the individuals in the group. 

Musgrave (Ref. 6) has shown that if certain tax and expenditure 
decisions are decided by majority rule the outcome could be 
arbitrary. And yet political processes based upon majority rule do 
not appear to be chaotic, suggesting that individuals in these 
societies have fairly similar preferences, ranking their alternatives 
along some continuum. When the alternatives possess more than 
one attribute that is relevant to the ranking, that is, when the 
alternatives are positioned in more than one dimension, the 
arbitrary quality of majority rule is likely to rise. 

Majority rule carries with it the implicit assumption that each 
voter’s satisfaction in winning is equal to every other voter’s. If 
this is true, then majority rule achieves the greatest total satisfac¬ 
tion. This assumption concerning the comparison of utilities be¬ 
tween individuals is suspect and questioned by man y. Clearly, 
intensity of preference must vary greatly when choosing between 
some alternatives—I may barely prefer Candidate A to Candidate 
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B whereas you may violently oppose A and embrace B; but my 
vote counts as much as yours. 

Thus a political process using majority rule is a rather crude 
way of registering the preferences of the individuals comprising a 
group. This is especially so when the procedure is compared with 
an economic solution in a free enterprise situation where each 
individual’s preferences is registered very finely. Nevertheless, some 
qualifications must be made about this comparison. First, since 
voters with intense preferences are free to persuade, they may have 
more influence than voters with weak preferences. Second, an 
economic solution, although efficient, may be “unfair” because it 
is dependent upon the distribution of income. For example, the 
market solution in Saudi Arabia is not necessarily superior to a 
nonmarket solution in a more egalitarian society. (For a thorough 
treatment of the welfare aspects of the market solution see Ref. 5.) 

In this chapter we have generalized the decision making process 
in two ways. First by considering an “opponent” rather than a 
neutral state of the world. Only by imposing a further criterion, 
namely, the maxi-min criterion, could we specify the “best” course 
of action. Second, by considering the decision of a group. It was 
seen that majority rule is not sine qua non for rational group 
decision making. There is at least one further generalization that 
could be considered, the case where the decision maker may not 
even know all the alternatives available to him. We have implicitly 
assumed throughout that the set of alternatives were known to the 
decision maker. In practice, this is seldom the case. 

An architect, confronted with a site, a family, and a budget, 
cannot even conceptually enumerate or define the alternative 
house designs available. Instead he searches for a solution as if 
moving in a maze: he arrives at cul-de-sacs and he back tracks, 
he arrives at junctures that do not seem promising and he retraces 
his steps to some previous position so that he can move out along 
a new path. At some point he stops and presents his solution. 
The solution may satisfy all the imposed constraints but it would 
be a miracle if it were “optimal.” This same difficulty faces city 
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planners, political rulers, composers, and business men. Are there 
any principles or theories to guide us in these search procedures? 
This is an area now receiving much attention and which will prob¬ 
ably have the greatest practical application. For a path-breaking 
discussion, see Ref. 8, Chap. 3. 

PROBLEMS 

1. Show that in a 2-person zero-sum game an outcome in the decision 
matrix is an equilibrium outcome if, and only if, it is the minimum 
value in its row and the maximum value in its column. 

2. Construct a game which has more than one equilibrium solution. 

3. In Fig. 7-1, suppose that the distance between Si and S% is 5; between 
$> and S3 is 2; between S3 and S* 4 is 3. Write all the possible ways in 
which an individual can rank Si through S4; which of these possible 
rankings are single-peaked? 
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